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tion has been ascribed to numer- 
ous and varied factors. The literature 
embraces the fields of psychology, psy- 
chiatry, physiology, psychoanalysis, and 
sociology with the investigations orient- 
ed toward an evaluation of cultural fac- 
tors, personality factors, or the rela- 
tionship between personality and envi- 
ronmental forces. The most frequent 
point of attack has involved clinico-de- 
scriptive analysis of intrapsychic fac- 
tors relative to personality structure 
and content. Although the literature re- 
veals a basic core of agreement on the 
passivity and introversive nature of the 
alcoholic personality, the results re- 
ported in previous studies of chronic 
alcoholic patients reflect a striking mul- 
tiplicity of conclusions, which have 
prompted Landis [20] to emphasize the 
need for more adequately controlled 
investigations. 

There would appear to be two expla- 
nations for the diversity of opinion. 
First, a very large number of the per- 


| ws ETIOLOGY of alcoholic addic- 


e 
‘The writer acknowledges the suggestions 


and criticism of Professor William A. Hunt 
of Northwestern University. 





By SEYMOUR G. KLEBANOFF 


NORTHWESTERN UNIVERSITY 


lil 









sonality studies have been of a qualita- 
tive or descriptive nature and reflect 
the theoretical bias of the investigator. 
These clinico-descriptive studies have 
stressed latent homosexuality [34], la- 
tent heterosexuality [40], social inferi- 
ority and the need for social approval 
[7, 41], social and cultural conditioning 
[89], and individual intra-psychic per- 
sonality adjustment [26]. Second, chron- 
ic alcoholism is an extremely general 
and often amorphous clinical entity 
which is itself highly heterogeneous and 
requires accurate and careful definition. 
As Cimbal [5] has pointed out, chronic 
alcoholism appears to subsume several 
clinical entities which have in common 
only the fact that excessive use of alco- 
hol occurs in the clinical history. One 
subgroup may be termed symptomatic 
chronic alcoholism. In such cases, the 
addiction constitutes the dominant 
symptom or complaint in a basic malad- 
justment of neurotic quality. Meyer 
[24] cites this reaction as the most fre- 
quent type of chronic alcoholism. The 
symptomatic chronic alcoholic patient 
differs from other alcoholics in that he 
seeks relief from his addiction, mani- 
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fests an overlay of surface anxiety con- 
cerning his alcoholism and, as Durfee 
[8] has reported, reveals strong con- 
scious feelings of guilt over his asocial 
mode of adjustment. It is necessary to 
differentiate such symptomatic chronic 
alcoholism from other clinical entities 
manifesting alcoholic addiction. Pa- 
tients who present long standing his- 
tories of diverse and generalized psy- 
chopathy in which chronic alcoholism 
represents only one segment of the total 
maladjustment would appear to fall in 
a separate category. The term reactive 
chronic alcoholism would seem to char- 
acterize still another group of patients 
in whom the addiction develops upon 
some immediate and specific environ- 
mental stress at a relatively conscious 
and superficial level. Finally, one finds 
addiction in individuals described by 
Jellinek [16] as “poverty drinkers” and 
by Landis [20] as “the down-and-out 
Bowery Bums.” Such patients may rep- 
resent chronic states of reactive alcohol- 
ism, or more probably psychopathic per- 
sonalities with alcoholism. When any of 
the above clinical groups progress to the 
point of alcoholic psychosis, one is of 
course dealing with still another clinical 
entity, which in turn must be distin- 
guished from nonalcoholic psychosis 
with secondary alcoholism. 

Many studies have treated chronic 
alcoholism as a homogeneous entity, 
classing it as an essential psychoneu- 
rotic reaction which constitutes an es- 
cape from the challenge of reality [1, 
17, 24]. Others refer to a general per- 
sonality inadequacy or psychopathy 
which includes alcoholic addiction as a 
portion of the symptom-complex [9, 15, 
18, 25]. In contradistinction, several 
investigators conclude that no specific 
or characteristic personality pattern can 
be detected, and that chronic alcoholism 
remains an extremely heterogeneous en- 
tity from the point of view of etiology 


as well as personality structure [10, 2) 
36, 38]. The lack of agreement in these 
personality studies reflects clearly the 
need for more rigid definition and ey. 
perimental] analysis of the various cliyj. 
cal types of chronic alcoholism. 

While the qualitative or descriptive 
studies dominate the literature, ther 
have been some experimental studies jy 
recent years which have employed psy. 
chological test techniques. Wittmay 
[42] has reported results obtained with 
the use of various personality inventor. 
ies and stresses inadequate intraindi. 
vidual adjustment as the core of the 
problem. Studies employing projective 
techniques have been reported by Roe 
[30] who utilized both the Rorschach 
and Thematic Apperception tests, and 
by Billig and Sullivan [8, 4], Halper 
[11], and Seliger and Cranford [37]. 
Roe was unable to find significant differ- 
ences between excessive, moderate, and 
social drinkers, although the Rorschach 
test indicated that the excessive drink- 
ers were highly sensitive, lacking in ag- 
gressiveness, and showed more general- 
ized signs of maladjustment. The Ror- 
schach studies of Billig and Sullivan, 
and of Halpern emphasized underlying 
passivity, immaturity, tendency toward 
environmental withdrawal with strong 
introversive orientation, weak and un- 
stable affective control, and unsuccess- 
ful efforts to develop a secondary extro- 
versive mode of compensation. 

Lewis [2], Landis [20], and Cranford 
and Seliger [6] have emphasized the 
need for more controlled studies of per- 
sonality factors in chronic alcoholism. 
The present investigation attempts to 
avoid systematic bias by approaching 
the problem objectively by means of a 
quantitative analysis of phantasy pro- 
duction obtained with the Thematic Ap- 
perception Test (TAT) in a group of 
chronic alcoholic patients. The second 
problem, that of the heterogeneity of the 
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alcoholic group, is controlled through 
ithe careful selection of a homogeneous 
| croup of alcoholic patients. The experi- 
mental group will be limited to those 
alcoholic patients who may be designat- 
ed as symptomatic chronic alcoholics, 
all other types of alcoholism being ex- 
cluded. As stated above, the main char- 
acteristics of this group are desire for 
relief from addiction, an overlay of sur- 
face anxiety, conscious feelings of guilt 
concerning the excessive use of alcohol, 
and a general pattern of psychoneurotic 
madadjustment. 


PROCEDURE 


A group of seventeen male patients 
comprised the experimental group.” 
They were a group of successive volun- 
tary admissions to a state hospital and 
all were diagnosed by the psychiatric 
staff as symptomatic chronic alcohol- 
ism, without psychosis. The mean age 
of the group was 40.3 years with a 
range from 28 to 51 years. The mean 
educational level was 10.3 grades com- 
pleted with a range from grade six to 
three years of college. Six patients en- 
tered college, but none succeeded in 
completing college. Consideration of the 

2The data for this study were collected at 
the Clarinda State Hospital, Clarinda, Iowa, 


through the cooperation of Dr. Norman D. 
Render, Superintendent. 


educational level of the group suggests 
that the group probably falls above the 
average level of the general population. 
The entire series of twenty pictures 
of the original Thematic Apperception 
Test (TAT) of Morgan and Murray 
[27] was presented to all seventeen pa- 
tients by a single examiner in accord- 
ance with the conventional test instruc- 
tions. The TAT was selected because it 
appeared especially adapted for an 
analysis of personality content. Clinical 
experience with this technique has indi- 
cated that subjects tend to project their 
personal problems and conflicts into the 
phantasy productions. Specific studies 
of validity have been reported by Mas- 
serman and Balken [22], Rotter [31], 
Harrison [12, 13, 14], and the literature 
has been reviewed by Sargent[33] in a 
discussion of the validity of projective 
personality test techniques in general. 
The thematic productions were ana- 
lyzed qualitatively and quantitatively 
by means of a standardized scoring sys- 
tem developed by Rotter and Rodnick 
[32] and Rodnick and Klebanoff [29]. 
A group of thirty-six themas which 
have been found to appear with signifi- 
cant frequency in TAT content was se- 
lected, and these were grouped into five 
general categories including themas of 
(a) loss of life; (b) physical aggres- 


TABLE I 
CLASSIFICATION OF THEMAS 


PHYSICAL 
AGGRESSION 

Physical Violence 
Injury-Accident 
Illness 
Threat-Reprimand 
Crime 
Punishment 

Arrest-Capture 


LOSS OF 
LIFE 
Murder 
Death 
Suicide 


Domination 
Antagonism 
Dishonesty 
Disloyalty 
Quarrels 
Hate-Revenge 
Anger-Rage 
Deprivation 


NON-PHYSICAL 
AGGRESSION 


EMOTIONAL 

STRESS 
Mental Conflict 
Emotional Stress 
Misfortune 
Disappointment 
Depression 
Intoxication 
Jealousy 
Fear 
Separation & 
Rejection 


MISCELLANEOUS 
THEMAS 

Repents 

Forgiveness 

Love-Gratitude & 
Friendship 

Happiness 

Affection 

Goals 

Filial Obligation 
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sion; (c) non-physical aggression; (d) 
internal emotional stress; and (e) a 
group of miscellaneous items. The lat- 
ter group consisted of more neutral and 
pleasant themas. The themas selected 
for tabulation and analysis are shown 
in Table I. 

The frequency of occurrence of each 
of the themas listed in Table I was com- 
puted. In addition, the percentage of 
themas for each of the five general cate- 
gories was calculated for all patients and 
individual profiles were constructed. 

A second type of analysis involved 
consideratious of the type of successes 
and failures of the characters in the 
phantasy productions of the patients. It 
was felt that such expressions would be 
related to personal feelings of inferior- 
ity or superiority. Those themas deal- 
ing with successes and failures were 
classified arbitrarily into four categories 
including (a) economic, (b) social, (c) 
power, and (d) love. Such a categoriza- 
tion of themas makes possible an eval- 
uation of the specific kinds of insecur- 
ities in the personalities to be studied. 

Finally, it seemed important to con- 
sider the characters to whom successes 
or failures were attributed in the stor- 
ies. One would anticipate a tendency to- 
ward greater self-identification with 
central characters rather than with 
minor characters in the phantasy pro- 
duction. Rotter [31] has indicated the 
frequency with which autobiographical 
material which is of personal signifi- 
cance is attached to the central charac- 
ters in the stories. The tabulation of 
successes and failures was done sepa- 
rately for central and minor characters. 
Personality content was treated in a 
qualitative analysis of the frequency of 
occurrence of the various themas. This 
evaluation was made at a relatively su- 
perficial or descriptive level without re- 
course to deeper psychoanalytic explan- 
atory concepts or interpretations. 
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RESULTS 


The essential findings of the group 
analysis were summarized in Table J] 


TABLE II 
MEAN FREQUENCY OF OCCURRENCE OF THEMATic 
CATEGORIES WITH STANDARD DEVIATIONs, 
COMPARED WITH RESULTS UPON A 
NORMAL GROUP 


PERCEN TAGE 
MAJOR OF 
CATEGORIES TOTAL THEMAS §S.D. 


NORMAL 
GROUP* 
Physical Aggression 21 9.8 
Non-Physical 

Aggression 17 
Internal 

Emotional Stress 48 
Miscellaneous Themas 14 


SUCCESSES AND FAILURES 
Failure: 

Central Character 
Success: 

Central Character 4.1 
Failure: 

Minor Characters ‘ 4.1 
Success: 

Minor Characters 2 12.5 


AREAS OF FAILURE 

Economic Failure 6.2 
Social Failure 3 9.1 
Power Failure 8.9 
Love Failure 


* Data from Rodnick and Klebanoff [29]. Data on 
areas of failure are not available for this normal group 
This gives the mean percentage of oc- 
currence of all the major thematic cate- 
gories. The individual records show a 
striking similarity to the group means 
as borne out by the standard deviations 
presented in Table II. The results ob- 
tained by Rodnick and Klebanoff [29] 
upon a “well-adjusted” normal group of 
male subjects are utilized for compari- 
son with the present alcoholic group. 

The test records of the alcoholic pa- 
tients reveal relatively little expression 
of aggressive tendencies. This is true 
for both physical and nonphysical the 
mas of aggression. Although physical 
aggression occurs slightly more fre 
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quently than nonphysical aggression, 
both together do not equal the number 
of themas depicting internalized emo- 
tional stress. More than 46 per cent of 
the total themas elicited deal with situa- 
tions expressive of affective stress and 
unhappiness. There is a relative absence 
of pleasant miscellaneous themas. 

Table II also presents the mean per- 
F centages of themas which deal with suc- 
cesses and failures for both central and 
minor characters. The degree of failure 
attributed to central characters consti- 
tutes one of the more striking findings 
of the study. Eleven of the patients 
show this predominance of failure of 
the central character. When the total 
number of themas is considered, those 
depicting central character failure out- 
number themas of internal emotional 
stress. Failure of minor characters and 
success of central characters tend to be 
negligible. Minor characters, however, 
do achieve some successes, which might 
be expected in view of the marked fail- 
ure of central characters. Table II ana- 
lyzes the failures of the central char- 
acters in terms of economic, social, pow- 
er, and love inferiority. Failure at the 
power level predominates in the group 
and accounts for nearly half of all the 
failures of central characters. Social 
failure also plays a significant role. Fail- 
ure of the central character in love and 
sexual situations includes but 15 per 
cent of the total themas of central char- 
acter failure, while economic failure 
plays a negligible role in the group. Of 
the 17 patients studied, 7 place greatest 
emphasis upon power failures, 6 are 
dominated by social failure, and the re- 
maining 4 cases reveal an equality of 
social and power failure. In all cases, 
power and social failures exceed eco- 
nomic and love failures. 

Table III analyzes the content of the 
phantasy productions and indicates the 
frequency with which each of the 36 


TABLE III 
FREQUENCY OF OCCURRENCES OF INDIVIDUAL 
THEMAS IN THE GROUP RECORD 


Loss Murder 
OF Death 
LIFE Suicide 


Physical Violence 
Injury - Accident 
Illness 

Threat - Reprimand 
Crime 

Punishment 

Arrest - Capture 
Escape 


PHYSICAL 
AGGRESSION 


Domination 
Antagonism 
Dishonesty 
Disloyalty 
Quarrels 
Hate - Revenge 
Anger - Rage 
Deprivation 
Mental Conflict 
Emotional Stress 
Misfortune 
Disappointment 
Depression 
Intoxication 
Jealousy 
Fear 
Anxiety 
Separation - Rejection 


NONPHYSICAL 
AGGRESSION 


INTERNAL 
EMOTIONAL 
STRESS 


Repents 
Forgiveness 
Love - Friendship 
MISCELLANEOUS Happiness 
THEMAS Affection 
Filial Obligation 
Goals 


Total: 


themas tabulated appeared in the test 
records. Of the total number of 576 
themas scored, 68 or 12 per cent involve 
alcoholic intoxication. Such themas of 
inebriety occur most frequently, fol- 
lowed closely by ideas of rejection or 
separation from a loved one such as 
wife, mother, or lover. The general pre- 
occupation with themas of emotional 
turmoil stands out and the most fre- 
quent indication of aggressive tenden- 
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cies appears in allusions to criminal be- 
havior. The domination thema is high, 
showing consistent domination of the 
central character by minor characters 
and this is compatible with the empha- 
sis placed upon central character fail- 
ure. The content varies widely among 
the individual patients with the excep- 
tion of domination of the central char- 
acter and intoxication themas which 
predominate in all of the cases studied. 
Stories which include instances of 
loss of life such as mirder, death, or 
suicide are notably rare among this 
group of patients. This is consistent 
with the general disregard for themas 
of aggressiveness. Death as a result of 
accident or illness is somewhat more 
frequent than other forms of loss of 
life, the patients preferring the more 
passive type of aggressive expression. 


DISCUSSION 


Despite the limited number of cases 
studied, certain of the results are so 
definite and consistent as to merit fur- 
ther consideration and interpretation. 
The general similarity of the individual 
TAT profiles to the composite profile of 
the group must be noted in view of the 
fact that many investigators have em- 
phasized the heterogeneous nature of 
personality structure in chronic alco- 
holic patients. In the present study, the 
experimental group was limited to a 
specifically defined type of chronic alco- 
holism which has been termed sympto- 
matic alcoholism. In this group, the 
test results reveal notable homogeneity 
with respect to personality structure as 
well as certain aspects of personality 
content. It is, however, the structural 
integration of the thematic categories 
which is particularly consistent within 
the group studied. 

The symptomatic chronic alcoholic 
patient seems to deviate from the so- 
called normal individual or well-adjust- 
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ed personality in the amount of his cop. 
scious or unconscious feelings of inade. 
quacy. The inadequacy and insecurity 
expressed involves primarily the power 
and social spheres. In this respect, the 
obtained results corroborate the quali. 
tative studies of Schilder [34], Wal 
[41], Curran [7], and Bales[2]. There 
appears to be a chronic fear of failure 
in social adjustment which is funda. 
mental to all the personalities studied. 

Similarly, the same mode of compen. 
sation is adopted to combat the severe 
power and social inferiority. The frus- 
tration which inevitably must accom- 
pany the strong feelings of inferiority 
does not provoke the degree of aggres- 
sive expression which one would expect. 
Indeed, when the degree of inferiority 
is evaluated in terms of its potential ca- 
pacity to generate frustration, the rela- 
tive absence of physical as well as non- 
physical forms of aggression becomes 
surprising. It is clear that the basic 
reactive direction is introversive in 
these patients with the frustration in- 
ternalized in the nature of severe inner 
emotional stress and conflict. These 
conclusions are compatible with those 
reported in more recent studies of 
chronic alcoholic patients with the Ror- 
schach test [3, 4, 11, 30]. That the un- 
derlying dependent needs and passivity 
of the symptomatic alcoholic patients 
inhibit any consistent outward direc- 
tion of hostilities is corroborated fur- 
ther by a predominance of themas of 
death by illness or accident, rather 
than by more violent forms of loss of 
life in the phantasy productions. In 
studying the relationships between the 
various categories employed, rank-order 
correlations were computed and a co- 
efficient of —.58 was obtained between 
physical aggression and internal emo- 
tional stress, with the experimental 
group exhibiting a definite tendency to 
internalize those motives which nor- 
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| mally give rise to more external or 
‘overt expression of aggression [29]. 
In discussing individual themas, we 
have mentioned the marked preoccupa- 
‘tion with intoxication and general alco- 
‘ holic indulgence. This is the most domi- 
' nant thema in each of the seventeen rec- 
ords analyzed. Perhaps this is not a 
surprising finding in a group of chronic 
alcoholic patients, but the frequency of 
/ themas of intoxication may be of prag- 
matic diagnostic value where adequate 
case histories are not available. Fur- 
ther, it would appear that the sympto- 
matic patient is extremely subjective 
and introspective concerning his mode 
of social adjustment, and resultant 
guilt feelings may well produce an in- 
ability to resist the projection of this 
type of content in the phantasy mate- 
rial. The surface preoccupation with 
alcoholism also suggests an awareness 
of the unsatisfactory nature of the so- 
cial compensation adopted, an aware- 
ness which is not often seen in other 
forms of neurotic reaction. A correla- 
tion of +.67 was found between the per- 
centage occurrence of themas of intoxi- 
cation and those of internal emotional 
stress. The interpretation may be of- 
fered that guilt feelings generated by 
excessive use of alcohol tend also to be- 
come internalized and as a result, affec- 
tive stress and conflict increase in se- 
verity, preoccupation with alcoholic 
content becomes greater and reflects 
more marked notions of guilt and mal- 
| adjustment. 

Further consideration of the thematic 
content indicates that ideas of rejection 
and domination of the central character 
by minor characters are common. This 
trend appears to be interwoven with the 
social and power inadequacies and the 
concomitant guilt feelings about the 
alcoholic mode of adjustment. A corre- 
lation of +.47 obtained between domi- 
nation of central characters by minor 
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characters and failures of central char- 
acters suggests a tendency for ideas of 
rejection to increase as general feelings 
of personal inadequacy become more 
marked. 

As the symptomatic chronic alcoholic 
patient strives to become a man among 
men, he must struggle simultaneously 
with his felt inferiorities and his pas- 


‘sive and introversive personality struc- 


ture which prohibits a primarily extra- 
tensive rapport with the environment. 
The transient adjustment afforded by 
excessive use of alcohol permits a 
pseudo-extroversive solution but at the 
same time the social censure involved 
serves only to heighten the internal 
stress, conflict, and guilt. The habitual 
use of alcohol by this group of patients 
constitutes a temporary tool in an es- 
sentially psychoneurotic reaction whose 
dynamic bases appear to lie in the insol- 
uble conflict between tremendous social 
and power inferiority on the one hand 
and an extremely passive and introver- 
sive personality pattern which is totally 
unsympathetic to the underlying psy- 
chological needs of the individual. 


SUMMARY 


The Thematic Apperception Test 
(TAT) was administered to a group of 
seventeen patients with symptomatic 
chronic alcoholism and the phantasy 
productions were analyzed quantitative- 
ly and qualitatively. The results indi- 
cate: 

1. A relative absence of aggressive 
tendencies. 

2. Marked emphasis upon themas de- 
picting internalized emotional stress 
with little concern for themas of a 
pleasant or neutral kind. 

3. A striking predominance of fail- 
ure attributed to central characters with 
consistent domination of central char- 
acters by minor characters. 


4. Power and_ social _ inferiorities 











118 JOURNAL OF CONSULTING PSYCHOLOGY 


characterize the content of the group as 
a whole. 

5. Intoxication, domination, and re- 
jection themas occur with highest fre- 


quency. 

An interpretation of the group find- 
ings in terms of the dynamic interaction 
of the various factors is presented. 
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service in 1942, the writers were 
assigned to one of the induction stations 
in Los Angeles. Their function was that 
of screening inductees for literacy and 
then testing all illiterates on officially 
sanctioned performance tests in order 


A: THE beginning of their Army 


to determine whether their general ca- 
pacity was adequate for Army purposes 
in spite of their illiteracy, or whether 
they should be rejected as failing to 
meet the minimum mental standards for 
induction. It sometimes happened, as 
Goldstein’s article [6] might lead one 
to suspect, that there was an apparently 
deliberate attempt on the part of occa- 
sional individuals to do less well on the 
literacy and performance tests than 
they were capable. As a check on sus- 
pected malingering, the Terman Vocab- 
ulary Test and the Information subtest 
of the Wechsler-Bellevue Intelligence 
Examination were used clinically in or- 
der to determine whether the individual 
in question might have greater general 
capacity than his literacy and perform- 
ance scores would lead one to suspect. 
Neither of these tests was officially 
sanctioned as were the others which 
were routinely administered, such as 
The Army Information Sheet, The Vis- 


ual Classification Test, The Wells’ Con- 
crete Directions or The Block Counting 
Test. These two unofficial tests were 
used only when it appeared that the offi- 
cially sanctioned ones had not given an 
adequate picture of the inductee — in 
other words, as a check on_ possible 
malingering. 

After some time it was felt that a 
more accurate adjunct test would be one 
which was standardized on the normal 
flow of the inductees themselves. Con- 
sequently, the writers put together 105 
items of general information and tried 
them out experimentally on 200 cases. 
After this preliminary tryout, the 105 
questions were reduced to 50 by process 
of item analysis. The 50-item test, 
called the AB Information Test after 
the initial letters in the surnames of the 
writers, was then standardized on 1,000 
inductees. The odd-even, stepped-up re- 
liability of the 50-point test was .94 + 
.003. Its correlation with the number of 
years of schooling claimed by the induc- 
tee was .64 + .02. Against the Army 
General Classification Test it correlated 
-78 += .02. When correlated with the 
Terman Vocabulary Test, the AB gave 
an r of .90 + .007. The relationship of 
the AB to annual income claimed by the 
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inductee was .45 + .02, in one study; 
97 + .04, in another. It thus appeared 
that the Information Test, like the Ter- 
man Vocabulary [4], had sufficient va- 
lidity and reliability to warrant its use 
as a supplementary clinical device. 

In mid-1943 the writers were assigned 
to special training units, i.e., units spe- 
cializing in military and literacy train- 
ing for those inductees classified by the 
Army as illiterate. In September, 1943, 
all the separate units were combined 
into one Special Training Center for the 
entire Ninth Service Command, which 
comprised the Western eighth of the 
United States. Other than the Army 
Wechsler Mental Ability Scale, Form B, 
no officially sanctioned Army tests ex- 
isted which could be used in the indi- 
vidual testing of these individuals whose 
average verbal intelligence quotient, as 
determined by the Wechsler, was about 
67. Tests were devised by the writers ur 
adapted from previously published ones 
(3, 5] to measure certain adjustment 
factors; the Terman Vocabulary Test 
was incorporated in the battery of indi- 
vidual tests; finally, the AB Informa- 
tion was also included as a supplement 
to the Army Wechsler and the Terman 
as measures of verbal aptitude. 


The AB was also translated into 
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Spanish, since over five per cent of the 
trainees received at the Center were 
non-English and Spanish speaking. As 
a measure of their general aptitude, 
these men were subjected to certain per- 
formance tests of the Army Wechsler 
[1, 2]; the translated AB was intended 
to be a measure, however faulty, of 
their verbal functioning, which is by no 
means perfectly correlated with the ca- 
pacities measured by performance tests. 
Just how effectively the AB in transla- 
tion measured the so-called verbal ca- 
pacities of the non-English, Spanish- 
speaking group was of course not known 
in the beginning, since it was recognized 
that meanings change on translation 
into another tongue and that norms 
adequate for one culture are not neces- 
sarily adequate for another. 


It seemed obvious after it had been 
used for a few weeks that the informa- 
tion test did an injustice to the bilin- 
gua!s in the Center since their mean 
scores appeared to be consistently lower 
than those for the native-born, White 
illiterates. It was then decided to de- 
termine the validity of the AB for bi- 
linguals and, in translation, for the non- 
English, Spanish-speaking group at 
some later date when sufficient data had 
collected in the inactive files of the Con- 


TABLE I 
RATIOS RESULTING FROM DIVIDING THE DIFFERENCE IN MEAN SCORES ON THE AB 
INFORMATION TEST FOR CERTAIN GROUPS BY THE 
STANDARD ERROR OF THAT DIFFERENCE 








Group 
White 

2. Colored 
Mexican* 
Mexican+ 
Filipino 
Chinese 
Indian 


Mean ‘5 
6.32 
5.36 


2069 
5.11 532 
5.06 . 1785 
4.52 . 91 
3.20 131 
3.01 320 


7492 


N- (2) 
2564 10.66 


(6) 
13.26 10.40 
3.16 7.20 
25 ‘ 5.46 
6.20 
2.87 


=. 2 


6.05 
1.25 





* Non-English, Spanish-speaking trainees, mainly Mexican but with a sprinkling of Central and South Americans. 


t Bilingual, speaking both Spanish and English. 
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sultants’ office. 

Table I presents data for seven of the 
racial and linguistic groups found in the 
Center. The paucity of general infor- 
mation among these illiterates is made 
immediately obvious by examination of 
their mean scores, which run from 6.32 
for the native-born White to 3.01 for 
the American Indian. These scores are 
in striking contrast with the mean score 
of 25 found for the total flow of men 
processed at the Los Angeles Induction 
Station. 

The order of magnitude of the mean 
scores of the several groups is practi- 
cally the same as that found on the Ter- 
man Vocabulary Test [4], the only dif- 
ference being the interchange of posi- 
tion of the Chinese and the Indian. The 
non-English, Spanish-speaking trainee, 
it will be noted, is superior to the four 
bilingual groups in mean score. He has 
a mean score, in fact, which is not sig- 


nificantly inferior to that of the Colored - 


group. One would infer from the size 
of the sigmas that this non-English 
group had a much greater spread in 
ability than the other groups, for their 
sigma is significantly larger than any of 
the other standard deviations. The great- 
er variability is to be explained by the 
two relatively dichotomous types of men 
included among the non-English, Span- 
ish-speaking group. Some of this larger 
group were born in the United States, 
had lived here all of their lives, had 
even attended English-speaking schools 
for a number of years; yet for reasons 
which may be ascribed in part to cul- 
ture (living and working with Spanish- 
speaking groups only, and that usually 
in isolated areas) and in part to a very 
low rate of learning capacity, they 
know scarcely any words in English at 
all. On the other hand, some of the non- 
English, Spanish-speaking trainees 
from E] Salvador or Mexico, were quite 
literate in their native tongue, had been 
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to school a number of years (a few evey 
attended college) and were non-Eng. 
lish merely because they had been in the 
United States only a comparatively 
short time, in some cases only four or 
five months. 

The Indian, with his sigma of 2.56, 
has less spread in information score 
than any of the other groups. All of the 
differences between the Indian and the 
other groups in this respect are signif- 
cant, barring the critical ratio of 2.74 
between his sigma and that of the Fili- 
pino’s; even this ratio indicates a prob- 
ability of a true difference in the order 
of one in 400. This smaller amount of 
dispersion in information scores for the 
Indian is surprising for his sigma on 
the Terman [4] is the largest of all. 
Since some of the items in the informa- 
tion test are those which are relatively 
more often learned in school than is 
true of the Terman, it may mean that 
the educational opportunity of the In- 
dian is minimal in comparison with the 
other groups. In any event he is sig- 
nificantly inferior in information score 
to all other groups, excepting possibly 
the Chinese. 

One further observation should be 
made about the means and sigmas in 
Table I. It is obvious purely from in- 
spection of their relative sizes that the 
distributions from which they are taken 
are markedly skewed; none of the dis- 
tributions approximate normality. In all 
cases the scores tend to pile up at the 
lower end of the distribution and slowly 
taper off toward the upper end. This 
distributional abnormality is due, of 
course, to the previously mentioned lack 
of “bottom” on the part of the informa- 
tion test. While the test dipped suffici- 
ently low for the whole range of intel- 
lect, it is still too difficult for those of 
such markedly restricted aptitude and 
educational training as were found in 
a Special Training Center. 
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Table II shows the relatively greater 
difficulty of the Terman vocabulary test 
(4], in comparison with the AB Infor- 
mation test, for the bilinguals. The 
mean scores of all the bilingual groups 
and of the Colored are interpreted in 


TABLE II 


THE DISTANCE, IN TERMS OF THE SIGMA OF THE 
WHITES, THAT THE MEANS OF THE OTHER 
Groups ARE BELOW THE MEAN OF THE 
WHITES ON THE TERMAN VOCABU- 

LARY TEST AND THE AB INFOR- 
MATION TEST 


Test Colored Mexican* Indian Chinese Filipino 


2.06 2.54 2.08 
1.04 .98 .56 


Termant .27 1.79 
AB 30 .39 


* These are, of course, the bilingual Mexicans. — 
+ For the Terman means and sigmas, cf. [4]. 


terms of the sigma distance their mean 
scores deviated from the White, the sig- 
ma used being that of the White. The 
Colored mean on the Terman is .27 sig- 
ma below the White mean; the distance 
below on the AB is .30, thus showing 
that the relative difficulty of the two 
tests is almost exactly the same for the 
two groups. In sigma terms, however, 
the bilinguals are from two to four and 
one-half times farther removed from 
the White norms on the Terman than 
they are on the AB information test. 
The greater difficulty of the vocabu- 
lary test for those who are bilingual is 
probably due to an artifact, for in a 
sense information is but a variant form 
of vocabulary. For instance, if one is 
asked, “How many inches are there in a 
foot?” he is asked what is generally 
called an information item. To make an 
analogous vocabulary item, one might 
rephrase it to, “What does the word 
‘inch’ mean?” A greater precision is re- 
quired to answer the second question, 
for if one knows the full meaning of the 
term ‘inch,’ he automatically knows one 
of its aspects, its relation to a larger 
unit, a foot, the so-called informational 
item. At the other end of the continuum 
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of verbal ability, one might ask, “What 
does ‘hagiography’ mean?” or “What is 
‘hagiography’?” Is this an item of in- 
formation or is it a vocabulary item? 
Depending upon one’s vantage point, it 
could well be either one, for similar 
items have been used in both types of 
tests. 

If information is a variant subtype 
subsumed under vocabulary, there must 
be some reason for the relatively great- 
er difficulty of the Terman vocabulary 
test in comparison with the AB infor- 
mation test in this study. It is the 
opinion of the authors, partly stemming 
from statistical evidence and partly 
from a feeling derived from having in- 
terviewed several thousand trainees in 
a Special Training Center, that the first 
ten words of the Terman are too close 
to the vernacular for a bilingual’s grasp 
unless he is fairly able or is well-edu- 
cated. Whether this tentative hypothe- 
sis accounts adequately for all or even 
most of the greater difficulty of the Ter- 
man is, of course, not known. 

Although the data in Table II show 
that the Terman Test was more marked 
in its discrimination against the bilin- 
gual than was true of the AB Informa- 
tion, it is nevertheless true that the lat- 
ter test is far from being valid for bi- 
linguals when mean score differences 
are taken into account. Only one of the 
bilingual groups, the Chinese, was sig- 
nificantly below White norms in terms 
of the percentage of its number which 
finally was graduated from the Train- 
ing Center. If literacy or the ability to 
attain literacy is used as a criterion, 
then it is obvious that the AB test, as 
was the Terman [4], is not valid for bi- 
linguals if their mean scores are com- 
pared with Whites. However, this find- 
ing does not prove that the test has no 
validity at all. If the AB can be shown 
to have predictive values for the bilin- 
gual when his score is interpreted in 
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terms of his own group, as was the case 
with the Terman and the Army Wech- 
sler [2], then validity can be shown in 
this instance to be a function of the type 
of norms employed. Before these other 
validating coefficients are presented, 
however, some data on the relationship 
of the Terman and the AB for restrict- 
ed type of intelligence found among illit- 
erates will be presented. 


It will be remembered that for the 
full range of ability Terman and AB 
correlated .90. Such an r would tend to 
indicate that the two tests come close 
to measuring the same thing. With the 
very narrow range of ability with which 
this paper deals, it should not be ex- 
pected that the 7’s for any one of the 
groups would even approximate this 7 
of .90. In Table III, it may be seen that 


TABLE III 


THE PEARSON PRODUCT-MOMENT CORRELATIONS 
BETWEEN THE TERMAN VOCABULARY AND 
THE AB INFORMATION TEST FOR 

THE VaRIOUS GRovI PS 


P. E., N 


031 100 
036 100 
.040 100 
055 73 
.049 100 
.049 100 


Filipino 


Chinese ............... 


the highest degree of relationship for 


any one group is for the Indian, .733, 
and the lowest degree of eon is 
for the Chinese, .522. 

The N’s involved in this table are -_ 
sufficiently large to make any of the dif- 
ferences in correlation statistically sig- 
nificant. It can be seen, however, that 
if the N’s were tripled or quadrupled 
the differences in correlational value for 
some of the groups would become sta- 
tistically significant, if their present 
values remained constant. 

The correspondence in scores between 
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the two test does not appear to arig 
from any cultural factor. Two of the 
bilingual groups, the Mexican and the 
Indian, have the highest degree of asso. 
ciation ; interestingly, however, another 
bilingual group, the Chinese, yields the 
lowest degree of association. The White 
and Colored groups, stemming from a 
common culture insofar as language js 
concerned, occupy an intermediate po 
sition, but even so the r’s for these two 
groups are .119 apart. 

A previous study of the Army Wech. 
sler [2] showed that in terms of dis. 
charge and graduation, the Colored and 
Indian groups were more predictable 
than other groups in the Center. Pre. 
dictability apparently has little to do 
with the association of the information 
and vocabulary tests, however, for the 
Indian and Colored come close to occu- 
pying opposing positions in Table III. 
When one considers the percentage of 
overlapping factors, obtained by squar- 
ing the coefficients involved, it is evi- 
dent that for the Indian the per cent of 
overlap in the two tests is approximate- 
ly twice what it is for the Chinese or 
for the Colored. Just how to interpret 
this greater degree of overlap is a prob- 
lem for which there is no ready answer. 
Certainly it cannot be a function of flu- 
ency in English, as measured by the 
Terman vocabulary test, for the two 
lowest groups on the Terman, the Chi- 
nese and the Indian, are at opposite 
poles in correlation value. 

Nevertheless, it does appear that 
there are real differences in the amount 
of overlap of information and vocabv- 
lary for the several groups, whatever 
the cause or causes of the differences 
may be. With the restricted range of 
intellect dealt with in a Special Train- 
ing Center, the size of the r’s in Table 
III is not unduly low even in compari- 
son with the correlation of .90 for the 
total range of testable intellect passing 
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TABLE IV 
THe VALIDITY OF THE AB INFORMATION TEST AS DETERMINED BY THREE MEASURES: 
THE CRITICAL RATIO, THE BISERIAL COEFFICIENT OF CORRELATION, 
AND THE TETRACHORIC COEFFICIENT OF CORRELATION 


~ Dispo- | 

sition 
Grad. 
Disc. 
Grad. 
Disc. 
Grad. 
Disc. 
Grad. 
Dise. 
Grad. 
Disc. 


Grad. 
Disc. 


Grad. 


Group 


Mean 


4.14 
1.12 


3.51 
1.25 


4.95 
1.67 


5.59 
3.17 


5.90 
3.28 


6.74 
4.65 


8.10 5. 


3.46 
1.80 


2.59 
1.41 


3.27 
1.25 


3.14 
2.15 


3.05 
1.97 


3.22 


Chinese... id 
Indian. 
Filipino....... 
Mexican* 

S Colored.........-.---- ; 
| enna 


Mexicant 85 


Critical r,,,  P-E. 


Sigma Ratio 


6.55 


9.66 


6.37 


18.13 





* Bilingual, speaking both Spanish and English. 


+ Non-English and Spanish speaking, largely Mexican, though including some Central and South Americans. 


through one of the Induction Stations 
in Los Angeles. 

Three types of intragroup indices of 
validity for the AB are shown in Table 
IV: (a) The critical ratio of the mean 
differences between those graduating 
from the Center and those discharged 
for failure to graduate, (b) the biserial 
coefficient of correlation between the 
graduates and discharges and (c) the 
tetrachoric coefficient of correlation be- 
tween the graduates and discharges. 
The two types of coefficients essentially 
duplicate the information already given 
by the critical ratio technique. Never- 
theless, they do indicate in more readily 
understood terms the degree of associa- 
tion found in the AB test with the cri- 
terion here employed. Both coefficients 
have their advantages. A probable er- 
ror can be computed for a biserial co- 
efficient while it is extremely difficult to 
do so for a tetrachoric one. On the other 


hand, the tetrachoric coefficient of cor- 
relation in the present study came clos- 
er than the biserial to showing the ac- 
tual predictive efficiency of the AB. 
One way of checking the validity of 
the two types of coefficients is to check 
their degree of correspondence with the 
formula for efficiency of prediction, [7], 
E =1—k. The tetrachoric r with the 
disposition of the Filipino is .73 for the 
AB, markedly higher than that of the 
biserial correlation of .538. The index 
of forecasting efficiency for the former 
ry is 31.7%; for the latter, 15.7%. Ac- 
tually, 52 out of 53 Filipinos who scored 
four or higher on the AB graduated 
from the Center; only one who scored 
in this range was discharged as inapt. 
Since there were only 91 Filipinos in 
this group altogether, it is clear that for 
52.8% of all the Filipinos one could ac- 
curately predict their type of disposi- 
tion with 98% efficiency. For this group 
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the tetrachoric r actually minimizes the 
forecasting efficiency of the AB and the 
biserial 7 markedly underestimates it. 
For the Indians of the study, the index 
of forecasting efficiency is 21.5% for the 
tetrachoric coefficient and 12.5% for the 
biserial r. In actual practice, 72 of the 
73 Indians who made scores of five or 
higher on the AB were graduated, while 
only one was discharged as inapt. There 
are 320 Indians in this group; hence, 
for 23.8% of them, the prediction would 
have been accurate slightly over 98% 
of the time. This percentage of predic- 
tive accuracy is in almost exact accord 
with the figures derived from the tetra- 
choric correlation by use of the index 
of forecasting efficiency. It would, 
therefore, appear that of the two types 
of coefficients the tetrachoric more 
nearly represents the validity of the AB 
than does the biserial. 

Regardless of the merits of the vali- 


dating indices, it is quite evident that 
the AB Information Test was quite ef- 
fective in predicting the type of dispo- 
sition of illiterate trainees, whether bi- 


lingual or non-English- and Spanish 
speaking, when intragroup norms were 
used in making that prediction. What 
is interesting is that the AB is more 
effective for the bilinguals and the non- 
English than it is for the native-born 
Whites of the study. Even the Colored, 
otherwise so predictable through indices 
of maladjustment [3] and the Army 
Wechsler [2], have generally a lower 
association value with the AB than do 
the bilinguals. It will also be noted that 
the non-English, Spanish-speaking 
group has an order of association with 
the AB in translation which is above the 
median value of all the groups. 

There seems to be evidence, then, for 
the belief that an information test is 
quite valid for bilinguals, but only if 
these groups are evaluated in terms of 
norms deriving from their own distri- 
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butions. There appears also to be equal. 
ly legitimate evidence for believing that 
a test does not necessarily lose in valid. 
ity when translated into a foreign lan- 
guage. One reason why this latter state. 
ment cannot be made in a more positive 
or generalized form is that the group 
from which the data are derived is not 
typical, necessarily, of the culture and 
language from which these particular 
non-English trainees came. One further 
restriction in interpreting the data is 
that the validating criterion for this 
type of trainee is his ability to learn to 
read, write and speak English: It might 
be surmised that one who knew more 
about the language and culture of the 
residents of the United States would 
for that reason per se tend to learn the 
English language with greater facility. 
While the foregoing statement may 
seem to be a reasonable one, the writers 
do not subscribe to it. If, to make a 
preposterous assumption, two groups of 
Americans, one of which knew some- 
thing of the Hairy Ainu in terms of 
general information and one of which 
did not, were parachuted into the habi- 
tat of this hirsute specimen, it is be- 
lieved that the former group would 
learn the language of the people with 
whom they were cast much more quick- 
ly than would those who previously 
had known nothing of the existence of 
their present hosts. The reason for this 
belief is that the former group would 
be brighter, more apt, or better educat- 
ed, or all of these, simply because their 
general information was broad enough 
to include the little-known Hairy Ainu. 
A like explanation, it is believed, ac- 
counts for the marked validity of the 
AB Information Test in translation. 
There can be, however, no assumption 
that with varying foreign populations 
or varying criteria a translated test 
would necessarily retain its validity. 
Next to be considered is the tetra- 
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TABLE V 
VALIDITY COEFFICIENTS AND CERTAIN OTHER DATA FOR INDIVIDUAL 


Indian 
NR Tr NR 


White Colored 
NRY r 


Item Item 
Diff§ Diff r 
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ITEMS IN THE AB INFORMATION TEST 


Mexicant 
rT NR r 


Filipino 


Item 
NR r VR 


42 82 .30 51 4 


5 32 53 33 Vote 

9 47 ¢ Yardstick 
8 35 Christmas 
6 44 5S Inch-Foot 


83 .20 64 


Opposite 
Ruby 

26 47 q Argentina 
19 .56 Empire 
Leningrad 
Fourth 
Cabin 
Telephone 
Crash 
Armistice 
Sh’ peare 
General 
Half-day 
Line 
r-Sawyer 
Eiffel 
C’fucius 
Cork 
Feet-Mile 
~ocore 
Dec’tion 
Poland 


Square 


9 
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* Only five to ten individuals gave the correct definition or only a similar number failed this particular item. A 
etrachoric fT based on such a small number of cases in the tail of the distribution is too unreliable to have anything 


t inspectional value. 


+ Bilincual, speaking both Spanish and English. 


t Non-English, Spanish-speaking trainees, mainly of Mexican 


§ Percentage of responses correct, based on standardizing population 


juction Station. 


Percentage of responses correct, based on 673 trainees (th 


{ “NR” 
73 only. 


stands for ‘““Number Right”; in each 


tnere are 


choric coefficient of correlation of each 
individual item of the AB Information 
Test with the dichotomy of graduation 
from the Special Training Center ver- 
sus discharge therefrom, for all of lin- 
guistic and racial groups considered in 
this paper. In addition to the coeffici- 
ents for the various groups, Table V 
presents some other data. Column ore 
gives the number of the individual itein 
as originally standardized. Column two 
shows the percentage of correct re- 
sponses to each item given by the stand- 
ardizing population of the Los Angeles 
Induction Station. The third column 
shows the percentage of correct re- 


instance there are 


nationality and ancestry. 


»f 1000 cases in the normal! flow of an In- 


is table) in this Special Training Center. 


100 cases, exeept for the Filipino, for whom 


sponses for each item for the group of 
illiterates of this study. The percent- 
ages in this third column somewhat un- 
derestimate the abilities of the total 
population of the Training Center since 
there are proportionately far too many 
of the low scoring groups — Filipinos, 
Chinese and Indian—represented in the 
table. The White and Colored groups 
were the “brightest” among the illiter- 
ate population. Just how relative 
“bright” is in this context may be gath- 
ered from inspecting the number of cor- 
rect responses for these two groups— 
the number of correct responses is here 
also a percentage—and comparing them 
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with the corresponding percentages in 
column two. 

Beginning with the eleventh item 
(and barring the sixteenth) there is a 
sharp drop in correct responses for the 
trainees as a whole. While they could 
give as many as one-fourth the number 
of correct responses as the standardiz- 
ing population on the tenth item, the 
proportion drops to one in approximate- 
ly thirteen for the eleventh item. The 
sixteenth item, which appears to violate 
the order of difficulty, is easily under- 
stood when the item is once stated, 
“How many stars does a full general 
wear?! Since the trainees have been in 
the Army from one to two weeks before 
they are tested, it is obvious why they 
do better on this question than on some 
of the others. 

The coefficients for the White group 
on the AB are not very good, as might 
be expected from the validity of the 
whole test for the same group. The 
highest r for any item is for item eight, 
.47. So few of the Whites missed any 
of the first four items that the questions 
possess no discriminating power at all. 
From six through ten, the discrimina- 
tion of thé items is only fair, though all 
the coefficients are, of course, positive 
and of some value. Excepting items fif- 
teen and sixteen, fewer than 10 per cent 
of the Whites answered any of the ques- 
tions correctly from the eleventh item 
onward. 


The most valid items in the AB for 
the Colored are the first four questions, 
the r’s ranging from .53 to .70. The 
most valid question, the fourth; is, “How 
many inches are there in a foot?” If 
the Colored trainee did not know the an- 
swer to this question, he stood slightly 
better than an even chance of discharge 


1 This item was standardized before Con- 
gress created the five-star “General of the 
Army.” The question refers to a generai with- 
out adjectival or prepositional modifiers. 
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for being inapt; if, however, he ap. 
swered it correctly, his chances of grad. 
uation were nearly ten in eleven. The 
third question, ‘““Whose birthday do we 
celebrate on Christmas?” was also quite 
valid for the Colored. About 57 per cent 
of those discharged as inapt missed this 
question while only some 15 per cent of 
those who graduated missed it. Except. 
ing the ninth question, all of the items 
from the sixth through the eleventh 
were rather valid for the Colored. Be. 
yond the eleventh item little was gained 
from the standpoint of item validity, 
though much may have been contributed 
so far as the validity of the total score 
was concerned. 

The Christmas item, number four, is 
the most valid for the American Indian. 
Fifty-two per cent of those who gradu- 
ated answered this question correctly; 
only 7 per cent of those who were dis- 
charged knew the answer to the ques- 
tion. Viewing the data in another way, 
one may say that 45 of the 46 Indians 
who answered this question in a correct 
fashion were graduated. The first four 
items in the AB are the only ones with 
much validity for the Indian, however, 
except for the one previously mentioned 
in another connection—the question re- 
lating to the number of stars a full gen- 
eral wears. Except for this item and 
one other, all the questions from the 
seventh on proved to be so difficult for 
the Indian that none of them had any 
real discrimination value. 

The item with the greatest validity 
for the bilingual trainee of Mexican an- 
cestry is the question, “Where is the 
Empire State Building located?’ Only 
one trainee who knew the answer re 
ceived a discharge. The Christmas ques- 
tion and another, “What is the opposite 
of south?” proved to have good discrimi- 
nation value for the Mexican, as did the 
geography question, “Where is Arge?- 
tina?” Beginning with item nine, how- 
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; ever, this item and those succeeding had 
little discrimination power, largely be- 


cause too few of the Mexicans answered 


} any of them correctly. 


There are three items of almost equal 
validity for the non-English, Spanish- 


) speaking trainee: They include the ques- 


tion relating to Christmas, the one 


' about the inches in a foot, and the 


tenth item, “Why do we celebrate the 
Fourth of July?” Two of these items 
relate definitely to North American cul- 


' ture, the inch-foot and the Fourth of 


July items, in contradistinction to the 
Mexican and Central American culture. 
Yet these very items are among the 
most valid, even in translation, for those 
trainees who were mainly brought up 
in another culture. The relative bright- 
ness of these trainees who are largely 
of foreign extraction may be gathered 
by the fact that two more of them an- 
swered the question concerning the 


Fourth of July correctly than did the 


Colored trainees. All of the items 
through the tenth possess some validity 
for the Spanish-speaking trainee; be- 
yond that point too few answered the 
items correctly for there to be any 
question of discrimination. It will also 
be noted that two of this non-English 
group answered the twenty-sixth and 
twenty-seventh items correctly — items 
which are beyond the range of the sam- 
ple of other groups presented in this 
table. 

The three best items for the Filipino 
are “What is a yardstick used for?” 
“What is the opposite of south?” and 
“Where is Argentina?” Except for the 
fourth item relating to inch-foot, all of 
the first ten of the AB possessed rela- 
tively good validity for the Filipino. Be- 
ginning with item eleven and beyond, 
validity for the individual items is lack- 
ing, owing primarily to the paucity of 
correct answers. 

For the Chinese, as with the Filipino, 
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the first ten items of the AB possess 
quite good validity, ranging from .24 
for the question, ‘““Where is Leningrad?” 
to .65 for the item, “What is the oppo- 
site of south?” Beyond the tenth ques- 
tion, the only item with good discrimi- 
nation is the sixteenth, concerning the 
number of stars a full general wears. 

One of the most striking observations 
which derive from a study of Table V 
is that the validity of items is not an 
all-or-none affair, equally good or poor 
for all groups, but is very greatly influ- 
enced by the group which is being test- 
ed. One item in the AB seems to negate 
this proposition somewhat: For in- 
stance, the question concerning Christ- 
mas is valid for all groups, except possi- 
bly for the Whites. The group?r’s for this 
item are White, .24; Filipino, .38; bi- 
lingual Mexican, .47; Chinese, .56; non- 
English, Spanish-speaking, .57 ; Colored, 
.60; Indian, .67. One might assume that 
the validity of this item is due to its 
being common to Christian culture, ex- 
cept that such an observation does not 
apply to the Chinese. Generally speak- 
ing, however, this generalization holds: 
The validity of an item is a function of 
the group being measured. 

It might be assumed that items too 
closely connected with school learning 
would be invalid for the trainee. Some 
of the items were too difficult, even the 
easier questions, for the illiterate sol- 
dier. For instance, few could answer 
correctly the question, obviously closely 
related to literacy, “What war is asso- 
ciated with Uncle Tom’s Cabin?” “Name 
one of Shakespeare’s plays” was equally 
difficult for the trainee. But before be- 
lieving too implicitly in such a generali- 
zation, one should note the lack of abil- 
ity on the part of the trainee to answer 
the question, ““When does Armistice Day 
come?” Also it should be noted that 
three simple questions of geography, 
questions seven, eight and nine, were 






























































































































































































































quite valid for the trainees in general, 
though they are almost as much of a 
school item as the twentieth question, 
“Where is the Eiffel Tower?” School- 
ing, per se, may be important as a hy- 
groscopic agent in the absorption of 
general information but it must not be 
forgotten that a reservoir also is neces- 
sary—and the name of the reservoir, if 
the figure of speech will be pardoned, 
is aptitude. 


SUMMARY AND CONCLUSIONS 


1. The AB Information Test was 
shown to have adequate validity and re- 
liability for the total range of intellect 
to be found in the norma! Induction 
Station flow. 

2. In the light of the percentage of 
graduates within a Special Training 
Center, the bilingual’s mean score on 
Information Test was shown to be sig- 
nificantly inferior to those obtained by 
the White and Colored trainees. Since 
the percentage of bilinguals who were 
graduated from the Center was not 
much different from the percentage of 
native-born Whites who graduated, it 
can be seen that in terms of mean 
scores the AB Information Test was in- 
valid. 

3. When bilinguals were evaluated 
in terms of norms from their own 
groups and the criterion was set as 
graduation from the Center or dis- 
charged therefrom, the AB was shown 
to be quite valid. Validity of associa- 
tion with the dichotomy of graduation 
versus discharge was demonstrated by 
three techniques: (1) the critica] ratios 
of the differences in mean scores for 
those discharged for inaptness and those 
graduated from the Center; (2) biserial 
and (3) tetrachoric coefficients of cor- 
relation. The individual item validities 
of the information test for each racial 
or linguistic group were also presented. 

4. The validity of the AB test for 
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the criterion mentioned in the preced. 
ing paragraph was actually greater fo, 
bilinguals and, in translation, for the 
non-English, Spanish - speaking group 
than it was for the native-born whites. 
whose validity coefficients were the low. 
est of all the groups studied. One prob 
able explanation for the lowered valid. 
ity among the Whites is that they rep. 
resented a narrower range of intellect 
among their group than was true of the 
bilinguals, owing to the differentia] 
rates of illiteracy in the Ninth Service 
Command. The non-English, Spanish. 
speaking group was definitely superior 
to the Whites in terms of schooling and 
in intelligence quotient as determined 


by performance subtests of the Army 
Wechsler. 


- 


5. The validity of the AB Informa- 
tion Test for the non-English, Spanish- 
speaking trainees was somewhat sur- 
prising since it was necessarily given 
in translation and was partly based 
upon a culture which was foreign to 
most of this particular group. Varying 
the criterion, the range of aptitude of 
those tested, or the type of foreigner 
might vitiate the validity of the AB, it 
is pointed out. But if anyone asserts, 
without experimental evidence, that a 
test valid for one culture is, a priori, 
not valid for another, he must bear the 
burden of proof, especially if the cri- 
terion for validation is the same for 
both cultures. 

6. When the item validities for each 
group in this study are compared, it be- 
comes evident that there is little evi- 
dence for belief in questions of univer- 
sal validity, or, one should say, of ap- 
proximately equivalent validity for all 
groups. Only one item, “Whose birth- 
day do we celebrate on Christmas?” 
appeared to have a high though cer- 
tainly not uniform validity for all 
groups. 

7. The distributions of all the illiter- 
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ate groups were marked by skewness. 
The basis of the skewness found in the 
AB almost certainly lies in its difficulty 
for the low level of intelligence of those 
soldiers sent to Special Training Cen- 
ters during the late war. 

8. The data educed in this study, 
that for the Terman Vocabulary [4], 
and that for the Army Wechsler [2] 
show that there is a strong need for.a 
general aptitude test standardized on 
the dullest of the “normal” groups, the 
borderline feebleminded and the defi- 
nitely feebleminded groups. Aptitude 
in this instance would have to be an 
elastic term which would include ad- 
justment factors [3, 5] since the prob- 
lem of diagnosis of feeblemindedness in 
individuals who have I.Q.’s between 50 
and 80 is, in the opinion of the writers, 
primarily contingent upon the degree of 
neurotic or psychopathic trends that are 
manifested. 

9. The AB Information Test is rela- 


tively less difficult for bilinguals than is 
the Terman Vocabulary Test, when the 
decrement between the White and bi- 
lingual score is measured in terms of 
White norms. 


10. Evidence from an earlier paper 
showed that bilinguals suffer the least 
decrement on an arithmetic test [2]. In 
the light of this finding and of the one 
noted in the preceding paragraph, it is 
suggested that if it is desired to com- 
pare the scores of bilinguals on individ- 
ually administered verbal tests with 
those of Whites, the bilingual would suf- 
fer less injustice if arithmetic and in- 
formation tests were employed. It would 
be preferable, of course, to compare 
them, if comparisons must be made, on 
performance tests. 

11. For the full range of ability, 
Terman and AB correlated .90. For the 
illiterate groups, the r’s are naturally 
smaller, owing to the narrowing of the 
range of testable ability. The Indian 


group showed the greatest concomi- 
tance of vocabulary and information 
test scores, but no hypothesis considered 
seemed adequate to account for their 
order of association. 

12. Owing to the type of item found 
in current information and vocabulary 
tests and to the extremely high r be- 
tween Terman and AB (.90), the the 
ory that information is simply a sub- 
type of vocabulary is proposed. 

13. The illiterates in this study did 
somewhat less well, i.e., knew fewer of 
the items, on questions which are con- 
ventionally supposed to derive primar- 
ily from schooling, but their inaptness 
on this type of item was but little great- 
er than for items which are not essen- 
tially school items. If a trainee an- 
swered the question, “How many inches 
are there in a foot?” with “Depends on 
how long yo’ foot is,” the writers be- 
lieve that lack of aptitude is as much 
at fault as lack of schooling. 


14. As have two preceding studies 
[1, 2], this one also shows that the bi- 
lingual’s ability to acquire literacy in 
English is greater than the native-born 
White’s, if their scores on vocabulary, 
information or Army Wechsler verbal 
subtests scores are equated. Although 
the populations in these three studies 
are not representative of their respec- 
tive groups because of the restricted 
range of intellect, there is at least some 
evidence for suspecting that the mark- 
edly depressed intelligence quotients re- 
ported in years past for the Indian and 
the Mexican, to mention only two of the 
bilingual groups, mask an order of 
learning which is higher than would be 
anticipated in Whites of a comparable 
intelligence quotient. Further research 
on this moot point is certainly indicated, 
for it would be of invaluable assistance 
to those who give educational or voca- 
tional counseling to bilinguals on the 
basis of aptitude test scores. 
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THE RELATIVE INCIDENCE OF PSYCHONEUROSIS 
AMONG NEGROES' 


By WILLIAM A. HUNT 


NORTHWESTERN UNIVERSITY 


HILE serving as clinical psy- 

chologist with the psychiatric 
unit of a large naval installation during 
the recent war, the author had the op- 
portunity of observing a large number 
of both negro and white sailors who 
were being held for neuropsychia- 
tric observation. The Whites represent- 
ed the typical sampling of naval recruits 
during the war. The Negroes were a 
relatively underprivileged, culturally 
handicapped group largely from south- 
ern areas and had been assigned to duty 
as Stewards’ Mates. One received the 
impression from observing both groups 
that the psychoneuroses, while occur- 
ring among the Negroes, were relative- 
ly less frequent in the negro group than 
in the white group. The psychological 
conflicts of the negro group seemed of 
a more simple, elementary nature re- 
sulting in the less complex type of 
symptomatology typical of sociopathic 
behavior, emotional instability, inade- 
quate personality, simple maladjust- 
ment, and temperamental unsuitability. 
The tortuous, intricately structured 
mechanisms typical of psychoneurosis 
seemed to be less common. 

Since that time, an opportunity has 
arisen of investigating this hypothesis 
through an analysis of the relative in- 
cidence in white and negro groups of 
psychoneurosis as a reason for dis- 


‘ This study was facilitated by a grant from 
the Social Science Research Council of North- 
western University. 


charge from the naval service. Several 
samples of both negro and white medi- 
cal surveys were analyzed. The first 
sampling represented a group of negro 
and white cases who were observed by 
the psychiatric unit of which the author 
was a member. In each case the diagno- 
sis was agreed upon by more than one 
psychiatrist, usually after consultation 
with a psychologist. An unselected 
group of 105 consecutive negro dis- 
charges for neuropsychiatric reasons 
were used as the experimental group. 
A control sampling of 105 white dis- 
charges was obtained by selecting the 
next white case following each negro 
case in the files. Since it was impossible 
to obtain any exact measure of the size 
of the total population from which these 
cases arose, the results are given, not 
in terms of the percentage of a normal 
population that developed psychoneuro- 
sis, but in terms of the relative inci- 
dence of the diagnosis of psychoneuro- 
sis among a neuropsychiatric popula- 
tion. This is satisfactory as our pri- 
mary interest is not in the absolute 
amount of maladjustment among Ne- 
groes and Whites, but in the relative 
frequency with which this maladjust- 
ment expresses itself in psychoneurosis 
rather than some other psychopathosis. 
Psychoneurosis as used here is a gen- 
eric term including the neuroses. 
Among the 105 negro neuropsychia- 
tric discharges only 7, or 6.7 per cent, 
were diagnosed as psychoneurotic. Of 
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the 105 white neuropsychiatric dis- 
charges 23, or 21.9 per cent, were diag- 
nosed as psychoneurotic. In this neuro- 
psychiatric population, psychoneurosis 
accounted for fewer of the negro dis- 
charges than of the white. 

In order to test the generality of 
these findings, five further samplings 
from widely varying naval installations 
were analyzed. Four of these samplings 
were taken as above; in the fifth in- 
stance the procedure was varied by tak- 
ing as a control group all the white dis- 
charges during the period covered by 
the negro discharges. The results are 
presented in Table I. Keeping in mind 


TABLE I 
INCIDENCE OF PSYCHONEUROSIS AMONG 
NEUROPSYCHIATRIC DISCHARGES 
N = number of sample 
Pn = number psychoneurotic 


Negro 


N Pn %& N Pn 


6 60 100 «10 


100 «6338.0 100 7 
104 14 13.5 104. 6 
68 4 63 63 21 
499 0 00 517 48 


the possible low reliability of the psy- 
chiatric diagnosis as well as the varying 
administrative procedures under differ- 
ent military commands (both of which 
are probably reflected in the variability 
between our samplings), it seems best 
neither to submit the data to any sophis- 
ticated statistical treatment nor to be 
too positive in drawing any conclusions. 
The fact that all five samples show a 
greater incidence of psychoneurosis 
among the white group, however, agrees 
with our original findings. A further 
confirmation may be found from the 
data presented by Gardner and Aaron 
[1] in another context where sample 
naval neuropsychiatric populations of 
100 Whites and 100 Negroes were com- 
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pared. The white psychoneurotics out- 
numbered the negro psychoneurotics 20 
to 8. 

Comparison of these military findings 
with those for a civilian population js 
rendered difficult by the generally un- 
satisfactory nature of the statistics for 
psychoneurosis. The psychoneurotic 
usually escapes hospitalization in a state 
institution and hence avoids the most 
common source of our statistics on men- 
tal disease. Thus Malzberg [6], report- 
ing on the relative incidence of various 
diagnoses among the first admissions to 
New York State hospitals from 1928- 
31, finds psychoneuroses and neuroses 
accounting for only .2 per cent of all 
negro admissions as compared with 1.9 
per cent of white admissions, but the 
total number of negro cases is only 3. 
Wechsler [10] quotes a statistical study 
from the Vanderbilt Clinic which shows 
an incidence of 27.5 per cent neurosis 
among Negroes and 55.5 per cent among 
Whites. Schilder’s [8] figures show the 
same trend on a Bellevue Hospital sam- 
ple. Lemkau, Tietze, and Cooper [4, 5], 
in their studies of the Eastern Health 
District of Baltimore, confirm the rela- 
tively greater incidence for Whites of 
both psychoneurosis and “neurotic 
traits.” This difference holds true for 
all sub-groups except hysteria, where 
the negro rate is almost four times the 
white rate. 

This reversal of the trend in hysteria 
is also found in some of our naval data 
where it is possible to isolate this diag- 
nostic sub-category. However, it has 
been my experience with many such 
eases of hysteria among Negroes (and 
culturally handicapped Whites as well), 
that the behavior does not resemble the 
classical picture of hysteria, but rather 
the behavior of an extremely suggest- 
ible, uncritical, emotionally unstable in- 
dividual. The somatization is crude, 
largely dictated by environmental cir- 
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cumstances, and bears little or no logi- 
cal or symbolic relationship to the indi- 
yvidual’s conflict. Under these circum- 
stances, a diagnosis of emotional insta- 
bility seems more fitting than one of true 
hysteria or conversion neurosis. The 
admission of such a point would natu- 
rally favor our basic premise, as the in- 
clusion of hysteria as a subcategory 


under psychoneurosis apparently acts - 


to narrow the difference in relative in- 
cidence between Negro and White. 

There is little negative evidence 
against the premise that the psychoneu- 
roses are relatively less common among 
Negroes. Heyman [2] complains of the 
lack of recognition of neurotic behavior 
among Negroes but presents no statis- 
ties, and much of her illustrative ma- 
terial suggests a possible confusion be- 
tween emotional instability and “neu- 
rotic” behavior. Wagner [9] presents 
some findings from the Cincinnati Gen- 
eral Hospital in which the relative in- 
cidence is about equal, but the number 
of cases is small. Hyde and Chisholm 
[3] give the rejection rates at the Bos- 
ton Induction Center during the last 
war. The percentage of Negroes re- 
jected exceeds that of any other “na- 
tionality” or “racial” group reported 
and the psychoneurotic rate per: total 
population examined is greater for Ne- 
groes than for Whites. When one con- 
verts their figures into relative fre- 
quency of psychoneurosis as a per cent 
of total psychiatric discharges, however, 
the relative incidence of psychoneurosis 
is less for the negro group. We had best 
treat such induction center statistics 
with care [11] since the diagnoses are 
based on a brief examination, often by 
relatively unskilled examiners, with lit- 
tle or no opportunity for subsequent ob- 
servation and confirmation of diagnosis 
before rejection. 

Keeping in mind the variability in 
neuropsychiatric diagnostic practice and 
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the unsatisfactory nature of available 
statistics, there still seems evidence for 
the relatively less frequent occurrence 
of psychoneurosis among negro mental 
disorders. My clinical impression is that 
this is a cultural phenomenon, as I have 
also observed the same trend among 
culturally handicapped whites, a find- 
ing which has been confrmed by Per- 
kins [7]. The cultural connection also 
is confirmed by Wechsler’s findings 
[10] where at least a partial control of 
the cultural factor was possible. He 
found that the incidence of psychoneu- 
rosis among a group of better educated, 
more intelligent, and socially superior 
British West Indian Negroes was great- 
er than among a less privileged Ameri- 
can group. 

Since the indication is not that the 
incidence of mental disorder in general 
increases with increasing cultural level, 
but rather that one type (psychoneuro- 
sis) of abnormal behavior increases, it 
suggests that cultural level is not a pri- 
mary determinant of psychological con- 
flict but rather a determinant of the 
form which the expression of conflict 
will take. It may well be that the cul- 
turally backward individual is permitted 
an immediate, primitive, uncritical, 
emotional expression that is denied the 
more highly cultured individuals sur- 
rounded as he is by limitless social ta- 
boos and personal inhibitions. With in- 
creasing cultural level the individual is 
deprived of the more immediate and 
primitive types of emotional expression 
with the resulting necessity for his con- 
flicts to express themselves through the 
devious and complicated mechanisms of 
the psychoneuroses. 

The final explanation, however, must 
await more evidence than is currently 
available. At present, we must limit our 
conclusions to the statement that psy- 
choneurosis seems relatively less among 
the Negro mental disorders than among 
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the White, and that the phenomenon is 
possibly culturally determined. 


trict. Fourth paper. Ment. Hyg., N. y 
1943, 27, 279-295. 
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FEAR AND COURAGE IN AERIAL COMBAT 


By LAURANCE F. SHAFFER 


TEACHERS COLLEGE, COLUMBIA UNIVERSITY 


ILITARY aviation in wartime 

gave an unparalleled opportu- 
nity to study the problem of fear as ex- 
perienced by normal young men who 
had faced the imminent danger of 
death. During the later phase of World 
War II, all Air Forces fliers who had 
completed tours of combat were re- 
turned to the United States through the 
AAF Redistribution Stations, where 
they arrived from four to eight weeks 
after leaving an active theater of war, 
with combat memories still fresh. This 
study reports an interrogation of 4,504 
such fliers who had just returned from 
combat. 

An anonymous questionnaire of 159 
multiple-choice items was used, devel- 
oped by a series of drafts, trials, and 
revisions.? A preliminary version of the 
interrogation was administered to 205 
fliers, of whom 34 were afterward in- 
terviewed at length. The interviews led 
to the discovery of any items that were 
not well understood, that were ambigu- 
ous, or that could not be answered can- 
didly. A few additional areas of inves- 
tigation were also suggested. The pre- 
liminary study resulted in a question- 
naire that could be used with consider- 


5 ‘This research was conducted as part of 
the Army Air Forces Aviation Psychology 
Program. At the time of the study, the author 
served as Chief, Psychological Division, Office 
of the Surgeon, Hq., AAF Personnel Distri- 
bution Command. For reports of related pro- 
jects, see Wickert [2]. 

?Neal E. Miller made substantial contribu- 
tions to the drafting of items. John Dollard 
supplied many helpful suggestions from his 
study of fear in American veterans of the 
Spanish Civil War [1]. 


able confidence that the fliers could com- 
prehend it, and could give revealing and 
truthful answers. 

The first 23 items were background 
questions concerned with age, rank, 
duty, and facts about the combat tour 
of the examinee. The remaining 136 
questions dealt with the frequency, 
symptoms, causes and effects of fear, 
and with factors aiding or hindering 
its control. Responses were made by 
writing numbers or letters on a separate 
answer sheet, arranged for convenience 
in transferring the answers to three 80- 
column cards for machine tabulation. 
Examinees were encouraged also to 
write remarks, comments and illustra- 
tions fully, on the back of the answer 
sheet. 


Three typical items follow. 


During combat missions did you have: 


(25) “Butterflies” in the stomach? 


1 often; 2 sometimes; 3 never. 


(59) On the whole, did you become more 
afraid or less afraid, the more combat mis- 
sions you flew? 
1 Became more afraid as I flew more 
missions. 
2 became less afraid as I flew more 
missions. 
3 There was no difference. 
(118) Shooting at an enemy plane: 
1 increases fear. 
2 decreases fear. 
8 has no effect. 
4 don’t know; no experience. 


The Fliers Examined. The interro- 
gation was administered to all fliers with 
combat experience who passed through 
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the three Army Air Forces Redistribu- 
tion Stations’ from May 15, 1944 to 
July 22, 1944. The group consisted of 
1,985 flying officers and 2,519 flying en- 
- listed men. 

Of the officers, 45 per cent were bomb- 
er pilots, 16 per cent fighter pilots, 20 
per cent navigators, 16 per cent bombar- 
diers, and 3 per cent bombardier-navi- 
gators. Captains and First Lieutenants 
comprised 86 per cent of the group. 
Fifty per cent were married, and the 
median age was 24 years. 

The enlisted examinees were almost 
all aerial gunners; 83 per cent of them 
fought in B-17 or B-24 aircraft. Ninety- 
five per cent were Technical Sergeants 
or Staff Sergeants. The median age was 
24 years, and 33 per cent were married. 

All theaters of combat were repre- 
sented. Thirty-nine per cent of the of- 
ficers and 53 per cent of the enlisted 
men were from England; 33 per cent of 
the officers and 26 per cent of the men 
were from the Mediterranean; and 28 
and 21 per cent, respectively, from the 
Pacific. The median examinee had been 
overseas for 10 months and had flown 
35 combat missions. 


There was striking evidence of the 
intensity of the combat experiences of 
these men. That half or more of their 
friends had been killed or were missing 
was reported by 38 per cent of them. 
Only two and one-half per cent stated 
that none of their close friends were 
killed or missing. Sixty per cent had 
comrades killed or wounded in the same 
plane. Thirty-five per cent of them had 
lost ten or more pounds of body weight 
while in the combat zone; about one- 
fifth had been wounded in action, and 
an equal number injured in connection 
with aircraft. They had fought when 
opposition was most intense, and when 


$ Atlantic City, N. J.; Miami Beach, Fla.; 
Santa Monica, Calif. 
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combat missions were extremely hazari. 
ous undertakings. 


FEAR IN COMBAT 


Almost All Fliers Were Afraid, 4 
Table I shows, 99 per cent of these air. 


TABLE I 
THE FREQUENCY OF FEAR IN Compart 
“How many times did you feel afraid whip 
flying on combat missions?” 


Per Centof Per Cento 


Officers Gunners 


Never ; 1 l 
Only once ..... 3 3 
Two or three times . 15 12 
About 1/4 of the time .. 19 13 
About 1/2 of the time .. 17 15 
About 3/4 of the time .. 12 14 
Almost every time . 22 26 
Every time 11 16 
100% 100% 


men said that they had been afraid 
Thirty-three per cent of the officers and 
42 per cent of the gunners were afraid 


TABLE II 
THE STRENGTH OF FEAR IN COMBAT 


“How did the fears you experienced in combat 
compare with any other fears you have had 
in your life?” 


Per Cent of Percentof 


The fears felt in 


combat were: Officers Gunners 


Very much stronger 

than any other 

fear I ever had.... 40 
Somewhat stronger 

than any other 


About the same as 
the strongest fear 
I ever felt in an- 
other situation .... 

Weaker than some 
other fears 

Never experienced 
any fear in com- 
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} on every mission or almost every mis- 


sion. Eighty-five per cent of the officers 
and 83 per cent of the men stated that 
they were afraid on their first mission. 

Table II shows the reported strength 
of fear in combat, compared to fears 
experienced previously. It will be noted 
that from 58 to 68 per cent said that 


| combat fear is very much, or somewhat, 
' stronger than any earlier experience. 


The enlisted personnel reported slight- 


ly more frequent and more intense fear 


than the officers, a finding that is con- 
sistent throughout the data. This may 
possibly be due to the facts that the of- 
ficers were more rigorously selected, 
had longer periods of training, and 
usually carried more responsible, var- 
ied, and interesting duties. These hy- 
potheses are related to some of the fac- 


TABLE III 
SIGNS OF FEAR DURING AERIAL COMBAT 


“During combat missions did you feel:” 


“Per Cent Reporting 
“Often” or 
“Often” “Sometimes” 
A pounding heart and rapid 
pulse ..... 30 86 
Feeling that your muscles are 
very tense 30 83 
Being easily irritated, angry 
or “sore” 22 80 


30 80 
“Nervous perspiration” 

“eold sweat” 26 79 
“Butterflies” in the stomach 23 76 
Feeling of unreality, that 

this couldn’t be happening 

to you 69 
Having to urinate (pass 

water) very frequently 5 65 
Trembling 64 
Feeling confused or “rattled” 53 
Feeling weak or faint 41 
Right after a mission, not be- 

ing able to remember de- 

tails of what happened : 39 
Feeling sick to the stomach 38 
Not being able to concentrate 35 
Wetting or soiling your pants 5 





tors that were found to reduce fear. 
Another possible hypothesis arises from 
an observation made in other studies 
that fighter pilots, who were always of- 
ficers, had superior morale. This re- 
search did not obtain separate results 
for fighter and bomber pilots. 

How It Felt To Be Afraid. The re- 
ported symptoms of fear in combat are 
shown in Table III. The physiological 
signs of emotion predominated, includ- 
ing rapid heart beat, dryness of the 
mouth, sweating, stomach sensations, 
and frequent urination. These findings 
are of especial interest because the ob- 
servers were not psychologists, and it 
is probable that only a small proportion 
were acquainted with the traditional 
“right answers.” The more severe phy- 
siological effects of nausea and of invol- 
untary urination or defecation occurred, 
but only infrequently. Muscular evi- 
dences included tension and trembling. 
More distinctively psychological effects 
of high frequency were irritability, 
feelings of unreality and inability to 
concentrate. 

The delayed effects of fear reactions, 
felt before, after or between combat 
missions, are reported in Table IV. 
These are of special interest, because 
the most frequently reported ones are 
the same as the symptoms of anxiety re- 
action to combat, or so-called “battle 
fatigue.” Almost all fliers reported fa- 
tigue, restlessness, depression, and over- 
reaction to loud sounds. Irritability, 
loss of appetite, and loss of zest for fly- 
ing also appeared frequently. More se- 
vere symptoms, including obsessive 
thoughts, bad dreams, and anxiety at- 
tacks were reported by more than one- 
half of the group. These results suggest 
that almost all fliers had at least a par- 
tial or temporary case of anxiety state 
as a reaction to combat. Fortunately, 
most of them recovered promptly when 
removed from the threatening situation, 
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TABLE IV 
DELAYED EFFECTS OF FEAR, BETWEEN MISSIONS 


“Before or after missions did you have:” 
Per Cent Reporting 
“Often” or 


“Often” “Sometimes’ 


Feeling all tired out............... 39 92 

Feeling restless, not being 
able to sit still 3 89 

Feeling generally “blue” and 
depressed . 

Being “jumpy” 
sudden noises Se RS 

Not getting along with peo- 
ple, getting angry or 
grouchy too easily 

Loss of appetite 

Not wanting to fly the next 
mission .... ee eee 

Being bothered by thoughts 
of combat coming up when 
you don’t want them to 

Bad dreams 

Having shaky hands or knees, 
or muscular twitches 

Suddenly feeling afraid with- 
out any special reason for 
1 ; ; 

Wanting company, can’t 
stand it to be alone 

Not wanting to be with peo- 
ple, wanting to be left 
alone 

Having to urinate (pass 
water) too frequently 

Feeling more afraid of other 
things, such as of riding in 
an automobile 

Hating the sight of an 
airplane 


at loud or 


although some continued to show these 
same symptoms after return to the 
United -States, and required further 
care in convalescent hospitals. 

When Fear Comes. Other questions 
were asked concerning the course or se- 
quence of fear during the combat tour. 
More fliers (56 per cent) feared the last 
missions than the early missions (15 per 
cent), or the middle ones (14 per cent). 
The remainder stated that it made no 
difference. 
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Fear tended to increase during the 
tour of combat duty. Most (48 per 
cent of officers, 57 per cent of men) said 
that they became more afraid as they 
flew more missions. A minority (29 per 
cent of officers, 20 per cent of gunners) 
stated that their fears decreased with 
experience. Neither trend was reported 
by 23 per cent of both officers and ep. 
listed personnel. The uncontrolled face. 
tor of variation in the difficulty of mis. 
sions probably affected these figures 
However, during the period that these 
fliers were in combat, losses and casvu- 
alties were decreasing rather than in- 
creasing, for the forces as a whole. 

On any one mission, the greatest 
number of officers, 48 per cent, felt most 
afraid when at or near the objective or 
target, but a substantial minority of 19 
per cent felt the greatest fear just be- 
fore taking off. The gunners did not 
show this bimodal tendency, but re- 
ported the strongest fear at the target 
where opposition was most intense, fol- 
lowed by a gradually developing sense 
of relief. 

What Is Feared. A significant change 
in the reported causes for fear took 
place between the first mission and the 
later ones, as Table V shows. The early 
fears were subjective ones, fears of fail- 
ure and of personal cowardice. Experi- 
ence seemed to bring self-confidence 
but also brought the real dangers of 


TABLE V 
REPORTED REASONS FOR FEAR, COMPARING 
FIRST MISSION WITH LATER MISSIONS 


Greatest fear was 


of being: 


On First On Later 
Mission Missions 


A failure hme . 40 14 
A coward a . & 7 
Killed ; 18 36 
Wounded 14 28 
Captured cele 6 15 


100% 
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death and injury more vividly to the 


; fliers’ attention. 


More fliers reported greater fear 


i when a mission was known well in ad- 

\ vance (43 per cent), than when they 
i were told of it suddenly (18 per cent). 
' This finding is in harmony with other 
| information about the effects of anxious 
} anticipation. 


Personal experience was most pro- 
vocative of fear, as a majority (59 per 


i cent) said that it was more fearful to 


go on a mission one’s self than to wait 
for the return of friends who had gone. 
An interestingly large minority, how- 


» ever, were so imbued with group mo- 
f rale that they found it equally fearful 
ito “sweat it out” awaiting their com- 


rades as to fly themselves (18 per cent), 


hor even more fear producing (23 per 
} cent). 


Answers to a number of other ques- 


| tions about the sources of fear can be 
} summarized, by 


general statements 
based on the agreement of sixty to sev- 
enty per cent of the group. Enemy ac- 
sause of fear 


airplane. More feared having their 
plane on fire than being slightly wound- 
ed, having an engine go out, or having 
the controls damaged. Of the types of 
enemy action, antiaircraft fire (flak) 
more often feared than enemy 
fighter planes, possibly because effective 
counteraction was rarely possible. 


was 


Fliers were no more afraid of wounds 
in one part of the body than another. 
Fifty per cent said that the location of 
the wound did not matter. Of specified 
wounds, those of the eyes, abdomen, 
brain and genitals were most feared, in 
that order. 


DETERMINERS OF FEAR OR COURAGE 

A large number of factors were enu- 
merated, which the men were asked to 
rate as to whether they increased fear, 


decreased fear, or had no effect. 
Helplessness and Hopelessness In- 
Fear. The circumstances that 
were reported to have increased fear to 
the greatest extent are shown in Table 
VI. Some pertinent generalizations can 


crease 


TABLE VI 
FACTORS REPORTED TO INCREASE FEAR 
IN COMBAT 


Per Cent 
Reporting 
Being fired on when you have no 
chance to shoot back 85 
Someone reporting an enemy plane 
that you can’t see sO) 
Seeing enemy tracers 74 
Having something go wrong with 
your oxygen equipment 
Feeling that you have been in so long 
that the law of averages is bound 
to catch up with you 
Not knowing how many missions you 
will have to fly before being sent 
home , 
Having to be idle or inactive while 
flying a combat mission 
3efore your first mission, hearing re- 
ports about the dangers and losses 
on missions 
Knowing that you have to achieve an 
objective “at all costs” 


be drawn from these data. Helplessness 
and hopelessness increased fear most 
often. Being attacked when one could 
not fight back or take any other effec- 
tive action, being idle when in danger, 
or being insecure of the future, seem to 
be elements that aggravated fear in 
combat. 

Factors That Courage. The 
conditions that reduced fear in combat, 
or helped to control it, are given in 
Table VII in the order of the per cent 
of fliers stating that they were of value. 
The effects of and 
were most conspicuous. ‘Having confi- 
dence in one’s equipment, crew, and 
leaders was the most frequent response. 

The second most notable condition for 


Assist 


confidence morale 
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courage was effective activity. Keeping 
busy, concentrating on the job at hand, 
taking evasive action, or making a good 
hit, reduced fear tensions by substitut- 
ing constructive efforts. The converse 
of this, that idleness and frustration in- 
creased fear, has already been noted. 

Social stimulation also reduced fear. 
Seeing others calm, talking on the inter- 
phone, joking and hearing others joke, 
were reassuring conditions. 


TABLE VII 
FACTORS REEPORTED TO DECREASE FEAR IN 
COMBAT, OR HELP TO CONTROL FEAR 


Per Cent 
Reporting 


Having confidence in your equipment 93 
Having confidence in your crew. 92 
Having confidence in the technical 
ability of your immediate superior 
or commanding officer 
Keeping sR all the time while i in 1 the 
air 
Sesing « or ‘hearing giter | men acting 
calmly in dangerous situations 
Concentrating on the job you have to 


Knowing that you will be sent back 
home after completing a definite 
number of missions.... 

Taking evasive action..... 

Having a Commanding Officer who 
does everything possible to look out 
for the food, shelter, and comforts 
of his outfit oi 69 

Shooting at an enemy plane..... oan 69 

Hearing others make wisecracks on 
the interphone ..... 66 

Cracking jokes on the interphone.... 66 

Making a good hit on the target 63 


The study also revealed a number of 


factors that neither increased nor de- 
creased fear ; they were reported to have 
no effect. Commendations or citations, 
flying pay, self-advancement, or hatred 
of the enemy, had no significant influ- 
ence on fear in combat. Having a knowl- 
edge of the broad social aims of the war, 
or a strong belief in the rightness of 
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the war aims, were also reported to be 
of little or no value in controlling fear. 

Talking About Fear. Some specific 
questions probed the issue of admitting 
one’s fears and talking about them, 
Talking about combat fears after ra 
turn to the United States made no dif. 
ference to most fliers (53 per cent), 
made many feel better (33 per cent), 
but made a few feel worse (14 per 
cent). 

Most fliers (68 per cent) agreed th: 
fears should not be discussed while fly 
ing on a mission. Between missions, a 
majority (59 per cent) believed that 
fears should be admitted and discussed, 
but 9 per cent of the officers and 18 per 
cent of the men said that they should 
not be talked about. The remainder 
claimed that it made no difference. 

Broader Motivations. A question was 
asked, “What are the three strongest 
reasons that carried you through com- 
bat missions ?” The complete list of alter- 


TABLE VIII 
BROADER ASPECTS OF MOTIVATION 
TO ENDURE COMBAT 


“What are the three strongest reasons that 
carried you through combat missions? 


Per Cent 
Reporting 


Becaus se it is my y duty to my country 
Not to let my outfit or crew down.. 
To complete my quota of missions or 
hours and return to the U. S.... 
To show that I did my part in the war 
To make the world a better place to 


For the love of flying.. ‘ 

So that my family and loved ones : will 
approve 

To get revenge for friends who have 
been killed 

So that I wouldn’t be considered a 
a 

Because I hate the enemy so much... 

To keep my flying pay.. 

To get commendations or citations. 

To be able to get a better job after 
ae 
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natives, and the results obtained, are 


og! ; ‘ 
icuggests that the fliers’ stereotypes rath- 


ven in Table VIII. Critical reflection 


er than their basic motivations may be 


Leflected in these answers, but the re- 


sults are given as they were obtained. 
In general terms, a sense of duty, or- 
ganizational morale, hope of survival, 
and personal pride in accomplishment, 
seem to be indicated by the responses. 
It is not surprising to note that the nar- 
rower selfish motivations were of low 


ivalue, including pay, citations, and self- 
fadvancement. 


Hatred, also had little 


significance to this group. 


IMPLICATIONS CONCERNING FEAR 
AND ANXIETY 


A number of implications of general 
psychological interest may be drawn 
from the study of reactions to aerial 
combat. In no other situation have data 
on intense fear been available from so 
large a number of normal persons. 

The data support an hypothesis that 
the adequate stimulus for fear is a high- 
ly motivated situation toward which 
the individual has no adequate means of 
adjustment. Situations that permitted 
no adjustive response, such as “being 
fired upon when you have no chance to 
shoot back” were the most frequently 
reported causes of increased fear. On 
the other hand, engaging in effective ac- 
tivity was a very frequently indicated 
element reducing fear. Even activities 
that merely kept a man busy, although 
they were not very effective in avoiding 
the real danger, tended to decrease fear. 

The inventory of immediate symp- 
toms show that fear is a state of pro- 
found bodily changes, with the most fre- 
quently noticed effects in the circula- 
tory, muscular, glandular, digestive, and 
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execretory systems, in that order. The 
distinctively psychological symptoms of 
irritability, confusion, feeling of un- 
reality, and post-traumatic amnesia 
were also illustrated. 

The reports of the after-effects of 
fear are of special interest when com- 
pared to the usual signs of neurotic 
anxiety. Following the intense emotion- 
al experience of combat, the presence of 
fatigue, restlessness, and depression of 
emotional tone were reported almost 
universally. Overreaction to sounds, loss 
of appetite, fearful dreams, obsessive 
thoughts, tremors, and tics were also 
frequent. These are exactly the symp- 
toms found in the psychoneurotic syn- 
dromes of anxiety state and “nervous- 
ness.” 

The Air Forces men were not chroni- 
cally neurotic, however. To the con- 
trary, they were almost surely an above- 
average selection in this respect. The 
data indicate that normal men showed 
these symptoms after biologically ade- 
quate stimulation to excessive emotional 
responses; this is fear. Psychoneurotic 
persons make these same reactions to 
symbolic dangers arising from the in- 
adequate solution of conflicts, and from 
the results of unfortunate learning sit- 
uations; this is anxiety, This study, in 
showing the causes of anxiety symptoms 
in normal men, throws some light on 
the relationship between anxiety and 
fear, and on the role of emotion in the 
psychoneuroses. 
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NOTE ON THE AGE PLACEMENT OF 
YEAR-SCALE TESTS 


3Y EDGAR A. DOLL 


THE TRAINING SCHOOL, VINELAND, NEW JERSEY 


ARRETT’S observations on McNe- 
GS mar’s argument on the age place- 
ment of year-scale tests [6], followed 
by Jaspen’s further comment [7], tak- 
en in relation to the significance of year 
scales of mental measurement, raise is- 
sues too long unresolved in this field. 
This note does not propose to accom- 
plish such a resolution but only to add 
further observations on the problem, 
the crucial significance of which has not 
yet been realistically acknowledged. 
This is aside from theoretical objections 
to year scales as such, a separate prob- 
lem which at the moment this note by- 
passes. 

It has long been evident to anyone 
critically interested in Binet’s own work 
on the Binet-Simon measuring scale for 
intelligence that Binet and Simon were 
not ingenuous in their consideration of 
this detail. Rather the unthoughtful 
critics seem not to have perceived the 
real problem. It is unfortunate that Bi- 
net and Simon did not present their ar- 
guments in favor of the procedure they 
adopted of a variable per cent of passes 
on individual tests of their scale and on 
the related problem of the divisions of 
the scale by unequal year groups and 
unequal numbers of tests per group. 
The present writer concerned himself 
with this question while assisting God- 
dard and Kite in checking and editing 
the translations of the articles published 
by them [1, 2]. Three variables were 
noted at that time, namely: (a) the 
year-scale intervals, (b) the number of 


tests per interval, and (c) the per cent 
of passes per test per interval. We were 
also concerned with a related problem, 
namely, the equivalent status of tests 
passed above the basal year as compared 
with tests failed below the basal year. 

It was noted as to (a) that the year 
intervals were of unequal spread, with 
the year intervals below III vaguely 
noted and with a two-year interval for 
X - XII, a three-year interval for XII- 
XV, and an undistributed interval for 
adults (1911 scale). 

It was noted as to (b) that there were 
six tests in all for the three year-inter- 
vals below Year III (0 - I, I - II, and II- 
III), with eight tests at Year VII, and 
three tests for Year XIII. One needs to 
recall that the 1905 scale was a hier- 
archical series of thirty tests not ar- 
ranged by year groups which included 
the six tests for below Year III; that the 
1908 scale consisted of fifty-eight tests 
unequally distributed from Year III to 
Year XIII by annual intervals with 
from three to eight tests per interval; 
and that in the 1911 revision the scale 
was revised from Year VI forward now 
with unequal intervals but an equal 
number of tests per interval. 

It was noted as to (c) that a variable 
percentage of passes was employed, 
roughly described as “a majority”, ac- 
tually ranging from about fifty to per- 
haps ninety per cent of “passes” per 
test item, but centering around sixty- 
five to seventy-five per cent. 

In deliberating these difficulties as far 
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pack as 1912 with the innocent purpose 
of resolving them in a revision and ex- 
tension of the 1911 scale (a project sub- 
sequently dropped because of Terman’s 
contemporary work), we formulated the 
yroblem somewhat as follows: 

~ 1. Mental development follows a gen- 
erally logarithmic course from birth to 
maturity, that is, mental growth is rela- 


‘tively rapid early in infancy and pro-: 


ceeds at a diminishing rate as age pro- 
gresses. We subsequently [4] modified 
this to suggest an early initial lag dur- 
ing the first few months of life. 

2, As mental growth progresses it 
“fans out” from a few measurable as- 
pects to many, that is, the expression of 
intelligence increases in number and va- 
riety of aspects or aptitudes as age ad- 
vances. 

3. Mental growth becomes increas- 


‘ingly variable with the advancing years 
Fin respect to its expressive aspects with 
§ individual differences increasingly 


marked. 
4. These different aspects of mental 


' growth are variously intercorrelated. 


5. Consequent to (1), the significant- 


} ly measurable intervals of growth be- 
come farther apart as age advances, 
| that it, it is much easier to distinguish 
| between a child of three months and 
} one of six months than it is between a 
» child of ten years three months and ten 
} years six months. Or the distinguish- 
; able differences between three months 


and six months may be roughly compar- 
able to the distinguishable difference 
between ten years and eleven years (or 
sume other amount). Thus, in the first 
year of life there might be as many as 
four distinguishable intervals, whereas 
in later infancy the intervals might be 
twelve months, and as maturity ap- 
proaches two or three or perhaps five 
years, In other words, the growth curve 
is asymptotic to a maturity ceiling, and 


as this ceiling is approached the distinc- 
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tion between intervals becomes increas- 
ingly difficult for discriminative meas- 
urement. 

6. Consequent to (2), the measurable 
aspects of mental growth are few early 
in life and many at maturity so that the 
number of tests per interval would be 
few in the early fragmentation of the 
scale and many in the later intervals. 

7. Consequent to (3), the per cent of 
passes is relatively high in the early 
life age intervals and relatively low, but 
presumably not much below 50 per cent, 
at the maturity intervals. 

8. In sum, a developmental year scale 
would then contain (a) “narrow inter- 
vals”, few tests, and a high per cent of 
passes at the infant level, with (b) in- 
creasing interval spread, increasing 
number of tests and decreasing per cent 
of passes as age advances. This system 
would be affected by the intertest cor- 
relations which are presumably high at 
the infant level and relatively low at the 
maturity level, but generally high 
throughout in order to maintain an ap- 
proximation to a relatively unitary sys- 
tem of measurement. 

9. There remains the problem of com- 
pensating failure and success. In the 
interest of simplicity of summation, 
should a test passed well above the basal 
year compensate equally for a test failed 
only somewhat below the basal year? 
Should tests passed at progressively 
higher intervals above the basal year 
all be counted as of equal plus value? 
Obviously it would be desirable to intro- 
duce some differential scoring weight 
for tests passed or failed according to 
their position on the scale as a whole in 
relation to the basal year. Or should 
one in the interest of simplicity and 
practicability hope rather wishfully for 
compensating balances on the theory 
that the likelihood or value of passing 
odd tests above or below the basal year 
is more or less self-equating? 
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10. In the summation of item-test 
scores into a total score, the per cent of 
passes for any given combination of 
tests would obviously be somewhat be- 
tween the per cent of passes for the 
“easiest” and “hardest” tests; that is, in 
a year scale the per cent of passes for 
year placement for a given test would 
be higher than that for all the tests in 
combination at a given year; or, where- 
as the standard percentage of passes for 
test items would vary from 50 per cent 
of passes to 100 per cent for a given 
year interval, the total per cent of 
passes for a given “mental age” would 
be 50 per cent (that is median or mean 
passing). 

11. But these last problems are fur- 
ther complicated by the unevenness of 
the ogives represented by the per cents 
of passes for a given test at successive 
age intervals. These ogives frequently 
intersect for successive test items so 
that one needs to take into account the 
per cent of passes for the total test 
ogive from zero per cent to 100 per 
cent, rather than by any single point or 
any fragment of the curve. It is quite 
unusual for tests to standardize in ex- 
actly parallel ogives or with comparable 
slopes. Consequently the inequalities of 
the ogive forms must be taken into ac- 
count and the total curve rather than 
some fragment of it should be employed. 
Very few year scales, not even the Ter- 
man-Merrill, have satisfied this require- 
ment since the tails of the ogives are 
frequently missing by not revealing 
complete zero or complete 100 per cent 
passes. 

12. This introduces the problem of 
item calibration and scaling and the de- 
termination of the equality of progres- 
sive units represented by tests of pro- 
gressively increasing difficulty. This 
problem is resolved (as has been done 
in constructing the Vineland Social Ma- 
turity Scale) by standardizing the indi- 
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vidual tests in terms of means and sig 
mas and then restandardizing the tot, 
scores by year intervals [3]. In such, 
scale the inequalities of the progresgiys 
item units are to a degree neutralizg 
by equalities in the summated unit 
But this in turn is influenced by the jp. 
tertest correlations which introdyp 
further complications where errors 4 
sampling and intricacies of statistic! 
manipulations are considerations of yy 
small moment. 

One might add other disturbing vari. 
ables such as the unequal reliability oj 
test items, and other attributes of item 
standardization such as the relation o 
item scores to sex and social-economir 
status. It would seem that the cut-and. 
fit method will have to continue in oper. 
ation until these difficulties are resolved, 
but in the meantime we can reduce the 
amount of “manipulative” standardiza- 
tion by increasing use of logical and sta- 
tistical warrant. 

Meanwhile care should be taken not 
to confuse hypothesis with conclusion. 
If our tests are standardized on an as- 
sumption of a normal curve of distriby- 
tion, we should not be surprised to dis- 
cover that the distribution is normal. In 
other words the problem of circularity 
of reasoning should be kept clearly in 
mind. Likewise if the tests have been so 
standardized as to produce a smooth 
progression of average units, we should 
not confuse the corollary of average cot- 
stancy of development with a “dis- 
covery” of constancy. And a fortiori we 
should not confuse average constancy 
with individual constancy. Nor should 
we deny the possibility of “nascent pe 
riods” in individual development when 
such a possibility has been neutralized 
in the determining principle of average 
standardization. Nor should we be sur- 
prised if there are no sex differences i 
the avoidance of sex differences has beet 
a principle in the test selection. And 
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likewise for other variables. 

The point of this note is to suggest 
that until these questions are harmoni- 
bously resolved there will continue to be 
both futile and misleading work in this 
area of mental measurement. The prob- 
lem of scatter, for example, is obviously 
related to the problem of item calibra- 
Ftion both as to the means and the varia- 
bilities of the item standardization. The 
‘same is true for sex differences, envi- 
ronmental differences, diagnostic signif- 
icance of test scores, and so on. In spite 
of the excellence of McNemar’s presen- 
tation of data for the Terman-Merrill 
scale [8], many of these critical issues 
have been either ignored or their signif- 


F icance dogmatically denied. 


It would seem that in spite of the high 
practical excellence of some year scales 
we are still a long way from achieving 
scientific or logical and statistical satis- 
faction. Some but not all of these issues 
have been faced frankly but by no means 


| satisfactorily resolved in the construc- 


tion of the Vineland Social Maturity 
Scale, the evidence for which is still in 
process of preparation for publication. 
Similar issues must be faced in stand- 
ardizing the Oseretsky scale for motor 
aptitude [5]. It is evident that this is a 


much more serious problem for mental 
measurement than appears to have been 
realized. Are the difficulties too serious 
for complete resolution in our present 
state of knowledge? 
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THE SHIPLEY-HARTFORD RETREAT SCALE AS A 
QUICK MEASURE OF MENTAL STATUS 


By SOL. L. GARFIELD 


VETERANS ADMINISTRATION HOSPITAL 
MENDOTA, WISCONSIN! 


HE SHIPLEY-HARTFORD RE- 

TREAT Scale [1,3] has many fea- 
tures which suggest its use as a quick 
screening test of mental ability. It is 
easily administered to a group, readily 
scored, requires a short time for ad- 
ministration and has a comparatively 
high ceiling. The total scale consists of 
an Abstraction Scale and a Vocabulary 
Scale, each having a ten-minute time 
limit. Separate scores are obtained from 
the individual scales, and the combined 
scores of both scales can be converted 
into a total mental age. A Conceptual 
Quotient is also obtainable to measure 
mental impairment, but it is the total 
mental age derived from the scale which 
is considered to be a measure of the 
mental level of the individual. 

In a recent study of neuropsychiatric 
patients in the Navy, Wright [2] con- 
cluded that the Shipley-Hartford Scale 
could be used for an easily determined 
approximation of the general intelli- 
gence level of the individual. In corre- 
lating total S-H scores obtained from 
the Wechsler-Bellevue Scale, he obtained 
a coefficient of correlation of + .77 be- 
tween the two tests. Although 32 per 
cent of his subjects obtained total S-H 
scores indicative of mental functioning 
at a borderline or mental defective level, 
Wright interpreted these findings as an 


1 Published with permission of the Chief 
Medical Director, Department of Medicine and 
Surgery, Veterans Administration, who as- 
sumes no responsibility for the opinions ex- 
pressed or conclusions drawn by the author. 


indication that his subjects were fune. 
tioning considerably below their origi. 
nal intelligence level (as measured by 
the vocabulary score of the S-H Scale). 
Shipley, in his manual for the scale, al. 
so states that if the Conceptual Quotient 
obtained on the scale is normal, the to- 
tal mental age is probably valid. Be 
cause of the particular nature of the 
scale, this statement of Shipley’s may 
imply that the effectiveness of the scale 
is not valid at the lower end of the in- 
telligence distribution.? Nevertheless, it 
appeared likely that the scale could be 
used as a rough measure of intelligence, 


PRELIMINARY INVESTIGATION 


The S-H Scale was administered by 
the writer to several hundred subjects 
in a Disciplinary Barracks, and it was 
noted that a comparatively large num- 
ber of them received rather low scores. 
Although exact figures are not avail- 
able, approximately one-third of the in- 
itial group tested secured mental ages 
below 12 years and required additional 
testing. After further study it was 
found by a rapid inspection technique 
that practically all men who tested 
above 11 years mental age on the S-H 
Scale secured an IQ of 80 or better on 
the verbal part of the Wechsler-Belle 

2The manual for the S-H Scale indicates 
that a “doubtful” Conceptual Quotient is ob 
tained when a subject secures a vocabulary 
score below 23. This would imply that prac- 
tically all individuals with total S-H mental 


ages below 13.7 years have “doubtful” Con- 
ceptual Quotients. 
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THE SHIPLEY-HARTFORD RETREAT SCALE 


vue Scale (Form 1). Therefore, it was 
decided that a mental age of 11 years 


Fon the S-H Scale could be used as a cut- 


off point in our screening procedures, 


land that all men who tested below that 


point should be retested on the verbal 
part of the Wechsler-Bellevue Scale. If 


Fan individual secured an IQ below 80 on 


the latter he would be given the per- 
formance part of that scale as well. 


THE PRESENT STUDY 


Following the preliminary investiga- 
tion, 350 men were tested on the S-H 
Seale using a total mental age of 11 
years aS a screening level. Out of the 
total group, 126 men fell below this lev- 
el and were retested on the W-B Scale. 
These 126 men constitute the group uti- 
lized in the present study for a compari- 
son of the S-H and W-B scales. The 
mean age of the group was 25 years 
with a range from 20 to 40 years. Their 
educational accomplishments ranged 
from no grades completed in school up 
through the twelfth grade. The mean 


TABLE I 


RELATIONSHIP OF SHIPLEY-HARTFORD MENTAL- 
AGE TO W®ECHSLER-BELLEVUE IQ 
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school grade completed was 6.34 grades 
with a standard deviation of 1.64. Un- 
fortunately, because of lack of time, on- 
ly the total mental age derived from the 
S-H Scale was utilized in the compari- 
son. 

The range of scores on both scales is 
presented in Table I. Because the group 
was a selected one and because the range 


'on the S-H Scale was very limited, it 


was deemed inappropriate to compute a 
coefficient of correlation between the 
two sets of test scores. The relationship 
can be seen readily in the table. Where- 
as the range on the S-H Scale was lim- 
ited, the IQ on the W-B Scale extended 
from 50 to 106. Out of the total group 
of 126 cases who secured mental ages 
below 11 years on the S-H Scale, 89 or 
70.6 per cent secured 1Q’s of 80 or bet- 
ter on the W-B Scale. Furthermore, of 
this latter group, 37 cases secured IQ’s 
of 90 or better. It should be noted also 
that of the 27 cases who were unable to 
obtain a mental age of at least 8.4 years, 
the lowest mental age obtainable on the 
S-H Scale, 12 were able to receive an 
1Q of 80 or above on the W-B Scale. Of 
the 37 cases who fell below an 1Q of 80 
on the complete W-B Scale, 18 were in 
the borderline category and 19 were 
judged mentally defective. It is thus 
found that of the 126 men who secured 
a mental age below 11 years on the S-H 
Scale only 29.4 per cent were below the 
dull normal level on the W-B Scale. 
From the data presented above, it is 
evident that scores of the S-H Scale are 
lower comparatively than scores ob- 
tained by the same group of 126 sub- 
jects on the W-B Scale. Mental age 
scores secured from the S-H Scale can- 
not be interpreted rigidly as definite in- 
dices of mental level, for a. majority of : 
subjects below 11 years mental age on 
3 With the exception of two other cases, 


these were the only subjects who received a 
complete W-B examination. 
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this scale definitely were above the bor- 
derline defective level as measured by 
the W-B Scale. While the group studied 
here, with a mean grade level of 6.34 
grades, was somewhat below average in 
terms of education, this is considered 
only a partial answer for their low scores 
on the S-H Scale. In Wright’s study the 
average educational level of the sub- 
jects was 10th grade; yet 32 per cent of 
his subjects were “functioning” at a 
borderline or mental defective level on 
the basis of total S-H mental age. This 
corresponds closely to the findings pre- 
sented in this study, for the 126 sub- 
jects reported here represent 36 per 
cent of a group of 350 tested. As men- 
tioned previously, Wright interprets the 
low scores of his subjects as represent- 
ing a functional loss of intelligence. 


This interpretation is questioned by the 
writer. A much simpler interpretation, 
and one which appears more valid, is 


that the S-H Scale, and the Abstraction 
Scale in particular, is exceedingly diffi- 
cult for the average and below average 
subject. 
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SUMMARY 

The results obtained from this study 
clearly indicate that the total ment, 
age obtained from the S-H Scale is low. 
er than would be secured from an indi. 
vidual scale such as the W-B. If ¢ 
former is to be used as a screening de. 
vice, scores at the lower end of the dis. 
tribution must be interpreted cautious. 
ly and a cut-off point must be derive 
experimentally for the particular grow 
tested. Certainly, the mental age secure4 
from the S-H Scale cannot be inter. 
preted as a valid mental age, particular. 
ly for individuals in the lower half of 
the intelligence distribution. 
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A NOTE ON THE USE OF THE WOODWORTH-MATHEWS 
QUESTIONNAIRE TO PREDICT THE “EMOTIONAL 
ADJUSTMENT” OF DELINQUENT BOYS 
IN A CORRECTIONAL SCHOOL 


By DALE B. HARRIS 


UNIVERSITY OF MINNESOTA 


URING the past twenty years the 
Mathews revision of the Wood- 
vorth psychoneurotic inventory [1] has 


heen widely used with juveniles. Some 


investigators have interpreted with cau- 

ion the quality or qualities tapped by 
this instrument. However, a high score, 
indicative of many atypical responses, is 
usually accepted as indicating an ab- 
normal emotional condition of neuroti- 
cism or emotional] instability. Occasion- 
ally, one meets the view that the instru- 
ment has some value for indicating be- 
havioral stability—the ability of a per- 
son to maintain a rather consistent emo- 
tional relationship to the problems of- 
fered by his environment. If the thesis 
is accepted that personality tests of the 
questionnaire sort ultimately rest back 
on clinical determinations, then the cru- 
cial trial of the predictive power of any 
test is to be found in the agreement of 
the preliminary test ratings with com- 
petent observer’s ratings, based on long 
and intimate acquaintance with the sub- 
jects’ behavior. A brief study, testing 
the inventory’s capacity to indicate un- 
stable behavior within an institution for 
delinquent boys, is here reported. 

The responses of 165 of 201 consecu- 
tive new commitments to the Minnesota 
State Training School were collected 
and examined. Seventeen cases with a 
reading ability below the fifth grade as 
measured by the New Stanford Read- 


ing examination were eliminated, along 
with 19 cases who omitted three or more 
items in the schedule. The mean age in 
months of the group studied was 189, 
and the mean I1.Q. was 90.3 (Kuhlmann 
Tests of Mental Ability). The mean 
score of the subjects on the inventory 
was 12.8 with a standard deviation of 
7.4 points. All cases outside plus and 
minus one standard deviation from the 
mean were selected as deviate groups. 
Nineteen individuals making a score of 
20 or above were designated as the un- 
stable group. Thirty-one cases falling 
below a score of five points were desig- 
nated as the stable deviates. The groups 
were thrown together and the individual 
cases arranged in order of their admis- 
sion to the Training School. 

The list of names thus prepared was 
presented to three staff members of the 
school who knew the boys very well. 
These three persons were asked to desig- 
nate the boys as stable or unstable with 
respect to their general adjustment in the 
school. Adjustment was defined in terms 
of ability to get along with a minimum 
of friction with other boys and of emo- 
tional “evenness” or “stability” at a 
level of expression not characterized by 
extreme mood, either constant depres- 
sion or constant elation. One of the ra- 
ters was the vocational director of the 
school and member of the disciplinary 
committee. Another was assistant to the 
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educational director at the time of the 
inception of the study and a teacher ac- 
tive with the boys in club and hobby 
work. The third, a teacher who adminis- 
tered the educational tests, had had ex- 
tensive psychological training and fol- 
lowed the boys’ educational adjustment 
with considerable professional interest, 
retesting them at the end of the year. 
Nothing was said to the raters con- 
cerning the nature of the study; none of 
them knew that the ratings were made 
with respect to Woodworth data. The 
Woodworth-Mathews scores themselves 
were unknown to these raters. The ra- 
tings were obtained after all the sub- 
jects had completed a year’s stay in the 
institution. The study suffers to some de- 
gree from the leveling effects of the ra- 
ters’ memories of the boys, since a num- 
ber of the subjects had been paroled a 


TABLE I 


AGREEMENT BETWEEN TEST AND RATER CLASSI- 
FICATIONS OF 31 “STABLE’”’ Boys (As DE- 
TERMINED BY THE WOODWORTH- 
MATHEWS INVENTORY) 


Rater Rater 
1 2 
Agreement with test classification 49% 
Disagreement with test 
classification 32% 


Rater uncertain 19% 


TABLE II 

AGREEMENT BETWEEN TEST CLASSIFICATION 
AND RATINGS OF 19 “UNSTABLE” Boys 
(AS DETERMINED BY THE WOODWORTH- 

MATHEWS INVENTORY) 


Raterl Rater2 Rater 3 


41% 37% 74% 


Agreement 
Disagreement 37% 
Rater uncertain 16% 


AT% 10% 


16% 16% 
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short time previous to the time of ra. 
ting. However, the ratings may be ap. 
cepted as valid characterizations by 
competent individuals who knew tho 
boys well. 

The raters were able to make clear. 
cut decisions in most cases. Each rater, 
however, returned a few names which 
he was unable to classify in either of 
the two categories suggested. An exami. 
nation of these names showed that with 
but four exceptions, the raters independ. 
ently allocated the same individuals ty 
this “uncertain category.” It is assume 
that these individuals were not emotion- 
al deviates, as far as the judges were 
concerned. 

Twenty-three per cent of boys whose 
test responses alone would indicate very 
good potential adjustment actually de- 
served “poor” ratings, according to com- 
bined estimate of observors. Thirty-one § 
per cent of “poor bets” received “good” 
ratings. In several instances the three 
raters agreed in disagreeing with the 
judgement of the test. 

If the raters’ estimates be accepted as 
valid, then the validity of the test itself 
for the purpose stated in this paper may 
be called into question. Discrepancies be- 
tween judgments of the test and judg- 
ments of school officials are substantial. 
Assuming that the school officials’ judg- 
ments were sound, “neurotic” responses 
to questionnaire items would not have 
predicted for the subjects examined, 
stable emotional behavior within the in- 
stitution with much greater than chance 
accuracy. 
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deem New Books and Tests 


- Which 
ther of 
eXxami- 
at with 
lepend- With this issue, the JOURNAL OF CONSULT- tistical methods to a cross-sectional descrip- 
uals to Hync PsycHOLOGY introduces a new policy of tion of personality, and to reduce the observa- 
sSumed Mepublishing short, prompt, reviews of books and_ tions to a minimum number of traits or fac- 
tests that are of interest to clinical and con- tors. Four principal sources of personality de- 
sulting psychologists. The interval between scription are utilized: clinical descriptions of 
publication of a book and its review in a pro- pathological syndromes, behavior ratings made 
fessional journal has often been a year or largely by the use of check lists of trait names, 
Whose Mmore. The long, critical, review serves a very self-inventory questionnaires, and objective 
te very [necessary purpose, but this is met adequately tests. The omission of projective methods as 
lly de- by other periodicals of the Association. There a source of data is perhaps surprising, but it 
| js also a need for shorter, informative, reviews it clearly the author’s choice and not his over- 
reenel o call new tools to the attention of professional sight. Of the Rorschach he says, “It remains a 
‘ty-one § psychologists. mixture of ill-defined intentions, analogous to 
“good” Reviews of newly published tests will be of a patent medicine, devoid of clear-cut theoreti- 
three MBparticular value, since this area is not covered cal bases that would permit refinement through 
th the mesystematically by any other psychological jour- research” (p. 463). As an end result of his 
* BBpal. By arrangement with its Editor, Psycho- factor and cluster analyses, Cattell gives a 
logical Abstracts will list and index all tests system of twelve primary source traits of per- 
ted as reviewed here, without description, but with a sonality, supported by all of the data consid- 
; itself BBcross reference to this JOURNAL. ered. The book contains two promises. A forth- 
ir may Many of the books and tests reviewed in this coming volume will describe personality from 
ies be- issue were published in 1946. In the future, a developmental or longitudinal point of view, 
judg- the reviews will appear more promptly. It is to complete the picture given by the cross-sec- 
; hoped that materials will be reviewed approxi- tional approach. A series of tests of the twelve 
antial. BPmately three months after their receipt, which primary personality factors has been prepared 
judg- is the shortest lag permitted by the printing and validated, both for individual and for 
,oNSes schedule. group testing, and soon will be published. 
- have Publishers of appropriate books and tests 
nined are invited to submit materials for review. 
ie te Readers will do the Editor a favor by calling , a g 
he in- Tibhis attention to books or tests that should be = “'°™ 1947. Pp. 86. 
hance fMfreviewed, and by expressing their opinions of Psychologists and educators who need a 
the new policy. clear and brief summary of the achievements 
: Editor. of the armed forces in the selection and classi- 
fication of personnel during World War II will 
Books find this monograph enlightening. Chapter I 
_— en ae is a remarkably full and concise account of 
ATTELL, LAYMOND B. Description and meas- the procedures used during the war, in rela- 
ure ment of personality. Yonkers-on-the-Hud- tively nontechnical language. Chapter II, “Im- 
son, N. Y.: World Book Co., 1946. Pp. 602. plications for Civilian Education,” suggests 
This is a volume of research findings, based the continuing applications of the techniques 
on data collected by the author, and on an and findings. Two technical appendices will 
analysis of some hundreds of other investiga- be of great interest to test construction spe- 
tons. Its purpose is to apply logical and sta-  cialists and statisticians. 
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Forp, MARY. The application of the Rorschach 
test to young children. Minneapolis, Minn.: 
Univ. of Minnesota Press, 1946. Pp. 114. 
The Rorschach test was administered to 123 

children, approximately 25 in each year group 

from age 3 through age 7. Means and stand- 
ard deviations of all Rorschach determinants 
are given for each age level, which may be of 
value in the interpretation of children’s rec- 
cords. Fifty-five of the children were retested 
after one month, and it was found that the 
reliabilities of the determinants ranged from 
+.38 to +.86. W scores did not correlate with 
mental age, but M, %F+, %0O+ and C-sum 

did show substantial correlations with intelli- 

gence in this group. Fair agreement was found 

between certain Rorschach patterns and ra- 
tings of introversion-extraversion, and of emo- 
tional adjustment. 


GARRETT, HENRY E. Statistics in psychology 
and education. (3rd Ed.) New York: Long- 
mans, Green, 1947. Pp. 465. 

In its third edition, Garrett’s Statistics re- 
mains the standard text for elementary in- 
struction because of its well considered selec- 
tion of content, and its clarity of presenta- 
tion. In recognition of the recent trend to- 
ward specific attention to the design of experi- 
ments, there is a new chapter on testing ex- 
perimental hypotheses, which includes an ex- 
panded treatment of the chi square technique, 
and an introduction to analysis of variance. 
The chapter on sampling and reliability has 
been revised extensively, with more emphasis 
on the statistics of small samples. Of interest 
to the statistics of test construction are pre- 
sentations of the Kuder-Richardson method of 
calculating reliability, and the Wherry-Doolit- 
tle multiple correlation technique for test se- 
lection. 


JERSILD, ARTHUR T., AND AssociATes. Child 
development and the curriculum. New York: 
Bureau of Publications, Teachers College, 
Columbia Univ., 1946. Pp. 236. 

The author and his associates have presented 
the fundamental facts of child development 
from infancy through adolescence as shown 
through changes in children’s overt behavior. 
Sound judgment has been shown in selection 
of areas of growth to be discussed and in re- 
search findings used to substantiate evidences 
of growth in relation to chronological age. The 
authors state frankly the gaps that exist be- 
tween our knowledge of children’s development 
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and knowledge needed for scientific revision , 
the school curriculum for promotion of opti. 
mum development of individual children. Chap. 
ter II is particularly outstanding for the pre. 
sentation made of basic principles underlying 
growth trends of children. As the title of th 
book indicates, the material presented is fy 
use by administrators, supervisors and teacher 
in service as criteria for evaluating propose 
revision of elementary and secondary schoo) 
curricula. 


KELLY, GEORGE A. (Ed.) New methods in ap 
plied psychology. College Park, Md.: Uniy 
of Maryland, 1947. Pp. 301. 

In November, 1945, the Military Division of 
the A. P. A. held a conference at the Uni. 
versity of Maryland on military contributions 
to applied psychology. The proceedings of the 
conference, published in this volume, will be 
significant source book for those who wish an 
over-all view of psychological accomplishments 
during the war, and who are seeking further 
applications of the methods that were devel- 
oped. The thirty-five papers cover a wide 
range of contributions, including research on 
selection, training, leadership, morale, equip- 
ment design, and clinical psychology. 


KLOPFER, BRUNO, AND DAVIDSON, HELEN H 
The Rorschach technique: 1946 supplement. 
Yonkers-on-Hudson, N. Y.: World Book Co., 
1946. Pp. 45. 

This paper-bound booklet is an appendix to 
Klopfer and Kelly’s The Rorschach Technique 
(1942), prepared to accompany the third 
printing of the basic book. The first part of 
the suppement is a brief review of new trends 
in Rorschach development since 1942. A sec- 
ond part is a chapter on a method of form-level 
rating. This is of particular interest because 
of Klopfer’s earlier rejection of Rorschach’s 
F-plus per cent as a score. His new proposal 
rates each response for form on a scale of 
from minus 2 to plus 2. A number of exam- 
ples of scoring are given. There is a supple 
mentary bibliography of 272 entries, from 1942 
through September, 1945, and a revised index 
to the complete volume, including the supple 
ment. 


Link, Henry C. The rediscovery of morals. 
New York: Dutton, 1947. Pp. 223. 


Dr. Link discovers what is wrong with the 
world today. 
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NEW BOOKS 


Clinical psychology. (Rev. ed.) 
Pp. 661. 


bouttit, C. M. 
New York: Harper, 1947. 
Louttit’s long-useful book undergone 
‘imal revision. Some new material has 
- added, as on the 1937 Stanford-Binet and 
» techniques of treatment. There are a few 
well considered deletions of dead wood. A num- 
her of sections have been rearranged and re- 
vised. Since the book’s field is defined as that 
of children’s behavior problems, the very brief 
mention of the Bellevue-Wechsler Scale can 
perhaps be excused. Less justified is the scant 
attention given to new and lively issues, such 
as projective methods, play techniques for 
diagnosis and treatment, and relationship ther- 
apy. The revision hardly seems to present a 
true picture of clinical psychology in 1947. 


has 


Mastow, PAUL. The analysis and control of 
human experiences. 16 Court St., Brooklyn, 
N. Y.: Author. Vol. I, 1946, Pp. 195; Vol. 
II, 1947, Pp. 229. (mineographed). 

These volumes represent an attempt to ex- 
plain all human characteristics and activities 
in terms of the Rorschach pattern. Problems 
f economics, war, fascism, politics and many 
other fields are subjected to a confused and 
usually inaccurate Rorschach appraisal. 


MAURER, KATHARINE M. Intellectual status at 
maturity as a criterion for selecting items 
n preschool tests. Minneapolis, Minn.: Univ. 
of Minnesota Press, 1946. Pp. 166. 


This study makes a direct attack upon a 
psychological problem of long standing: the 
low correlation of measures of intellect at pre- 
school ages with those obtained later. The 
Wells’ revision of the Army Alpha was given 
to 226 subjects, aged 16 to 22 years, who had 
previously been tested with the Minnesota Pre- 
school Seale before the age of 6 years. By item 
analysis, with the adult test as criterion, tests 
were distinguished whose predictive value held 
up through the years. Valid tests included 
digit span, word opposites, directions tests, 
picture puzzles and block designs. Invalid tests, 
In general, were those that made too heavy 
demands on language or on motor skills at 
early ages, or that were too easily affected by 
variable experience. The study provides valu- 
able source material for future test construc- 
tion. 


Sachs, WuLF. Black anger. Boston: Little, 
Brown, 1947. p. 324. 
A keen psychoanalytic account of the ex- 
periences and adjustments of a native African 
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medicine-man; a personal document of con- 
siderable significance in studying intercultural 
conflicts. 


SCHNELL, DorotHy M. Characteristics of ado- 
lescence. Minneapolis, Minn.: Burgess Pub- 
lishing Co., 1946. Pp. 68. 

An interesting tabular analysis of the traits 
of adolescents, condensed from 98 references 
to medical, psychological and educational liter- 
ature. 


The role of the teacher in per- 
(Rev. ed.)~ New York: Bureau 
of Publications, Teachers College, Columbia 

Univ., 1946. Pp. 497. 

This is the third edition of Dr. Strang’s in- 
fluential book for teachers and teacher-coun- 
selors. With its readable and nontechnical 
style, and its abundance of realistic illustra- 
tive material, this book will continue to stimu- 
late menta! hygiene activities and attitudes in 
the schools. It reaches the teacher who, with 
the parent, has a strategic place at the roots 
of mental health that can never be replaced by 
the professiona! psychologist. The new edition 
is enlarged and rewritten, with special em- 
phasis on practical methods. 


STRANG, RUTH. 


sonnel work. 


STRECKER, EpwArpD A. Their mother’s sons. 

New York: Lippincott, 1946. Pp. 220. 

The author develops a single theme in this 
book and has coined the word “Momism” to 
express the theme graphically. Differentia- 
tion is made between the emotional attachment 
of youth for their maternal parent as indicat- 
ed by the use of “mother” or “mom.” “Mom,” 
as distinguished from mother is, according to 
the author, “a convenient verbal hook upon 
which to hang an indictment of a woman who 
has failed in the elementary mother function 
of weaning her offspring emotionally as well 
as physically.” (p. 13). The author bases his 
indictment of “Moms” upon his psychiatric ex- 
perience with men in the armed forces who 
suffered from psychological breakdowns. 
Though the author presents his material in 
popular form, classes in sociology of the fam- 
ily and personality adjustment would profit 
by close examination of the basic hypotheses 
underlying Dr. Strecker’s indictment. 


WiTMER, HELEN L. (Ed.) Psychiatric inter- 
views with children. New York: Common- 
wealth Fund, 1946. Pp. 448. 

Psychologists who share the profession’s 
growing interest in psychotherapy will find 
this a most stimulating book. In it, ten illus- 
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trative records are presented in detail, each 
with a considerable amount of verbatim trans- 
scription of the interviews between the client 
and the therapist. The eight workers who con- 
tributed the case studies include two clinical 
psychologists (Phyllis Blanchard and Robert 
A. Young), one lay analyst and five child psy- 
chiatrists. Each therapist first states his con- 
ception of the client’s problems, and his aims 
and intentions for treatment. The reader is 
then able to watch the development of the 
therapy, in the interpersonal exchanges of the 
interviews. The details of technique differ from 
one case to another, determined in part by 
adaptation to the needs of the particular cli- 
ent, and in part by the theories and habits 
of the therapist. Broadly, all of the methods 
are variations of relationship therapy, in 
which the client feels secure in the therapist’s 
acceptance, and in which he can “be himself,” 
relatively free from the pressure to play a 
conventional social role. The general charac- 
teristics of the therapy are well summarized 
by Witmer in the introductory part and in a 
concluding chapter. The book will prove valu- 
able for the in-service training of counselors, 
and for the instruction of advanced students. 


TESTS 


California Test of Mental Maturity, by E. T. 
Sullivan, W. W. Clark, and E. W. Tiegs. 5 
levels: Grades Kgn-1, 1-3, 4-8, 7-10, 9-adult. 
1 form at each level. Untimed, (90) min. 
California Test Bureau, Los Angeles, Calif., 
1946. 

The chief features of this test are its analy- 
sis into language and nonlanguage abilities, 
and into five factors (memory, spatial rela- 
tionships, logical reasoning, numerical reason- 
ing, and vocabulary), on all of which useful 
scores can be obtained. It is a power test, 
without rigidly prescribed time limits. A pre- 
test detects children whose visual and auditory 
acuity, and motor coordination, might be so 
handicapped as to influence the results. Re- 
liabilities, in one-grade ranges, vary from .89 
to .97 for whole forms, and from .70 to .94 
for parts. Validity is chiefly inferred, but a 
correlation of .88 with the Stanford-Binet 
Test is stated, for an unspecified level and 
range. The 1946 revision gives extensive sug- 
gestions for the use of scores, and percentile 
norms for half-year or one-year age ranges 
as well as mental age and grade-level norms. 
The large amount of nonverbal material, even 
at the higher levels, is a distinctive and com- 
mendable feature of this test. 
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Clerical Perception Test, by G. B. 
Grades 9-12. 
tional Test 
1947. 

Test 1 consists of two number-checking parts 
of 100 items each, and Test 2 of. two nama 
checking parts also each of 100 items. Leyg 
of aspiration scores are obtained for Part p 
of each test, but these are not discussed jy 
the manual. The reliability of each test is 9 
Reference is made to studies on the validity 
of this type of test in vocational selection. An 
added value is noted, in the discovery of sty. 
dents subject to eye strain, who should receiys 
visual examinations. This test is part of the 
“Planning for Life” series of high 
guidance techniques. 


Saldwin 
1 form. 16 (22) min. Educa. 


Bureau, Minneapolis, Minn. 


School 


Gregory Academic Interest Inventory, by W 
S. Gregory. College students and entrants, } 
form. Untimed, (60) min. Sheridan Supply 

Co., Beverly Hills, Calif., 1946. 

The empirical method for the constructior 
of an interest blank was applied to academic 
interests by comparing the responses of jun- 
iors and seniors in specified major courses 
study, to those of freshmen. The 28 keys, for 
a wide variety of academic majors (agricul- 
ture, architecture, biological sciences, business 
administration, etc.) indicate the 
which the examinee’s interests resemble those 
of students who have selected these fields. The 
blank consists of 300 items, selected from 900 
tried experimentally, which describe activities 
typical of the areas of study. 
made on an IBM answer sheet, which may be 
scored by hand or machine. 


extent to 


Responses are 


Saldwir 
Educa 
Minn., 


Inductive Reasoning Test, by G. B. 
Grades 9-12. 1 form. 20 (25) min. 
tional Test Bureau, Minneapolis, 
1946. 

This 40-item number series completion test 
is printed on an IBM blank. Reliability is .93, 
and correlations of from .52 to .85 with ranks 
in high school mathematics classes are report 
ed. Intended for high school guidance use, this 
test is part of the “Planning for Life” series, 
of which other tests have been published pre 
viously. 


Mental Health Analysis, by L. P. Thorpe, W. 
W. Clark, and E. W. Tiegs. 4 levels: grades 
4-8; 7-10; 9-college; adult. 1 form at each 
level. Untimed, (45) min. California Test 
Bureau, Los Angeles, Calif., 1946. 

This series of personality questionnaires con- 
tains 200 items at each level. It yields two 
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NEW BOOKS 


principal scores, mental hygiene assets and lia- 
ilities, and ten subscores (assets: close per- 
onal relationships, interpersonal skills, social 
articipation, satisfying work and recreation, 
sdequate outlook and goals; liabilities: behav- 
oral immaturity, emotional instability, feelings 
»f inadequacy, physical defects, nervous mani- 
estations). Scoring may be accomplished by 
and or by machine. The virtues of this blank 
sre its emphasis on assets as well as short- 
omings, and its well-worded items, many of 
which seem sufficiently disguised to penetrate 
uperficial ego-involvements. The deficiencies 
pf the questionnaire are numerous. Although 
he total scores have fair reliability (.89 to 
97), almost all of the suggested interpreta- 
ions use the subtest categories, which are of 
nknown and probably low reliability. On 
me categories, a probably insignificant dif- 
ference of 3 score points may change a child 
from the 50th to the 99th, or from the 10th 
to the 50th, percentile. The intercorrelations 
f the 10 categories are not given, so that 
there are no data by which to judge their in- 
jependence or relationships as factors. More 
erious than these technical considerations is 
the question of the real contribution of such 


ijuestionnaires to individual] mental] health. 


Minnesota Rate of Manipulation Test. Revised 

manual, edited by G. L. Betts. High school 
to adult level. Individual test, 1 form. Edu- 
cational Bureau, Minneapolis, Minn., 1946. 
The new manual for the well-known Minne- 
ita Rate of Manipulation Test summarizes 
in convenient form much of the work that has 
en done on the test since its introduction in 
933. The disk-board used in the administra- 
ion of the tests has been modified somewhat. 
In addition to the two original tests, Placing 
and Turning, three new tests using the same 
juipment are described, the Displacing test, 
he One-hand Turning and Placing test, and 
he Two-hand Turning and Placing test. Data 
are presented on validity, reliability, and in- 
ercorrelations. Norms for the five tests are 
iven in percentiles, standard scores, and 
tanines. The “stanine,” a useful 9-point stand- 
ard score originated in the Air Forces, is here 
riven what is probably its first civilian appli- 
ation. 


School level. 1 form. Untimed. Educational 

Test Bureau, Minneapolis, Minn., 1946. 

This is an interest inventory, scored for 14 
Breas of interest, each of which is represented 
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ous and somewhat complex system of adminis- 
tration, each response of an examinee is a 
choice among four of the items, of which two 
are chosen as preferred activities. Responses 
are recorded on an IBM answer sheet, which 
may be hand- or machine-scored. Retest reli- 
abilities of the curriculum interest areas (9 
items each) range from .87 to .96, and of the 
social interest areas (5 items each) from .64 
to .87. Validity is indicated by students’ cur- 
ricular choices, and by the item-analysis pro- 
‘cedures used in the blank’s construction. Stand- 
ard score norms for each area are based on 
1119 eleventh grade pupils. 


Oseretsky Tests of Motor Proficiency, by N. 
Oseretsky. Portuguese adaption by M. I. L. 
da Costa, translated by E. J. Fosa. Edited 
by E. A. Doll! Ages 4-16 years. Individual 
test, 1 form. Manual, record blank, materi- 
als. Educational Test Bureau, Minneapolis, 
Minn., 1946. 

The translation of the manual and materials 
of the Oseretsky Motor Tests makes available 
to American research a tool for measuring mo- 
tor maturation that has been compared with 
the Binet-Simon tests in the intellectual sphere. 
First published in Russian in 1923, the scale 
has received some attention in Europe, but 
little in the United States. The tests are or- 
ganized by age levels from 4 to 16, with 11-12, 
13-14, and 15-16 combined as two-year levels. 
At each stage there are 6 tests, which meas- 
ure, respectively, general static coordination, 
dynamic coordination of hands, general dy- 
namic coordination, motor speed, simultaneous 
voluntary movements, and synkinesia (associ- 
ated involuntary movements). At the higher 
levels, some tests differ for boys and girls. The 
manual gives directions for the administration 
and evaluation of each test. Ingenious small 
drawings clarify the nature of the tasks, but 
some of the directions are defective and will 
have to be revised by further studies. The 
final scoring is by age levels, and gives both a 
total “motor age,” and a profile of the six as- 
pects measured. In their present form, no 
claims are made that the Oseretsky test norms 
are appropriate for American children, or that 
the test is a finished clinical instrument. It is, 
however, a challenging basis for research in 
an important and hitherto neglected area. 


Standard Profile Chart, by C. E. Jurgensen. 


Educational Test Bureau, 
Minn., 1947. 


The chart form is arranged for plotting the 
profile of an individual on any eighteen tests, 


Minneapolis, 
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his scores being expressed in percentiles, stand- 
ard (T) scores, stanines, or broad verbal cate- 
gories. Adaptations of the chart to group rec- 
ords, merit ratings, interview records, job eval- 
uations, and placement, are discussed in the 
manual. 


Test For High School Entrants, by L. D. and 
A. Crow. 1 form. 40 (50) min. Acorn Publish- 
ing Co., Rockville Centre, N. Y., 1947. 

This survey test consists of four parts, Eng- 
lish, reading comprehension, arithmetic, and 
general information, each with a 10 minute 
time limit. Reliability of the whole test is given 
as .94; that of the parts, from .91 to .97. Uses 
for promotion and for high school classifica- 
tion are recommended, but no data are 
on the test’s validity for these purposes. Cor- 
relation with a reading test is given as .94, 
and with a group intelligence test as .84. The 
percentile norms, on over 5,000 cases are for 
“high school entrants.” Since this might mean 
seventh, ninth, or tenth grade students, the 
applicability of the norms in questionable 
Vineland Social Maturity Scale, by E. A. Doll. 

Ages, 0-25 + years. Individual test, 1 form. 

Manual, record blank. Educational Test Bu- 

reau, Minneapolis, Minn., 1947. 

The Vineland Social Maturity Scale was 
first published in an experimental edition in 


given 


JOURNAL OF CONSULTING PSYCHOLOGY 


1935, and now appears for general use after 
several revisions and a wealth of research stud- 
ies. The Manual’s bibliography of 59 reje. 
ences, “representative rather than exhaustive” 
attests to the wide interest in the scale and the 
many contributions to its development. In jt, 
present form, the scale consists of 117 items, 
ranging in developmental level from the fing 
year of life to over twenty-five years. The 
scale is neither a test nor a questionnaire, by 


a series of behavioral observations, ranging 


through “does not drool,” “eats with fork,” 
” “makes telephone calls” 
“buys all own clothing,” to “advances general 
welfare.” The user of the scale is enjoined tp 
obtain in detail the facts on which to base the 
scoring of each item. Herein lies a potential 
weakness, not of the scale itself, but of thoge 
who may use it to evaluate inadequate or up. 
reliable observations and reports. The items 
are subdivided into categories: self-help (gen. 
eral, eating, dressing), locomotion, occupation, 
communication, self-direction, and socialization, 
which permit some diagnostic use. The final 
rating is a point score of the number of items 
passed, which is converted to a Social Age by 
means of a table. The scale is a valuable in- 
strument for psychologists who recognize the 
wide value of developmental studies, including 
aspects of personality in addition to intellect, 
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